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FIGURE 1 



CCAATCGCCCGGTGCGGTGGTGCAGGGTCTCGGGCTAGTCATGGCGTCCCCGTCTCGGAGAC 
TG CAGACTAAACCAGTCATTACTTGTTTC AAGAG CGTT CTGCTAAT CT ACACTTTTATTTTC 
TGGATCACTGGCGTTATCCTTCTTGCAGTTGGCATTTGGGGCAAGGTGAGCCTGGAGAATTA 
CTTTTCTCTTTTAAATGAGAAGGCCACCAATGTCCCCTTCGTGCTCATTGCTACTGGTACCG 
TCATTATTCTTTTGGGCACCTTTGGTTGTTTTGCTACCTGCCGAGCTTCTGCATGGATGCTA 
AAACTGTATGCAATGTTTCTGACTCTCGTTTTTTTGGTCGAACTGGTCGCTGCCATCGTAGG 
ATTTGTTTTCAGACATGAGATTAAGAACAGCTTTAAGAATAATTATGAGAAGGCTTTGAAGC 
AGTATAACTCTACAGGAGATTATAGAAGCCATGCAGTAGACAAGATCCAAAATACGTTGCAT 
TGTTGTGG TGTCAC CG AT TATAGAG ATT GG ACAGATACTAATTATTACTCAG AAAAAG GATT 
TC CT AAG AGTTG CTGT AAACTTGAAGATTGTACT C CACAGAGAGATGC AG AC AAAGTAAAC A 
ATGAAGGTTGTTTTATAAAGGTGATGACCATTATAGAGTCAGAAATGGGAGTCGTTGCAGGA 
ATTTCCTTTGGAGTTGCTTGCTTCCAACTGATTGGAATCTTTCTCGCCTACTGCCWCTCTCG 
TGCCATAACAAATAACCAGTATGAGATAGTGTAACCCAATGTATCTGTGGGCCTATTCCTCT 
CTACCTTTAAGGACATTTAGGGTCCCCCCTGTGAATTAGAAAGTTGCTTGGCTGGAGAACTG 
ACAACACTACTTACTGATAGACCAAAAAACTACACCAGTAGGTTGATTCAATCAAGATGTAT 
GTAGACCTAAAACTACACCAATAGGCTGATTCAATCAAGATCCGTGCTCGCAGTGGGCTGAT 
T CAAT CAAGATGTATGTTTG CT ATGT T C T AAGT C CACCTT CT AT C C CATT CATGTT AG ATCG 
TTGAAACCCTGTATCCCTCTGAAACACTGGAAGAGCTAGTAAATTGTAAATGAAGT 
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FIGURE 2 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA19902 
xsubunit 1 of 1, 245 aa, 1 stop, 1 unknown 
><MW: -1, pl : 8.36, NX(S/T): 1 

MAS PSRRLQTKP VI TCFKSVLL I YTF I FW I TGVI LLAVG I WGKVSLENYFS LLNEKATNVPF 
VLIATGTVI I LLGTFGCFATCRAS AWMLKLYAMFLTLVFLVELVAAI VGFVFRHE I KNSFKN 
NYEKALKQYNSTGDYRSHAVDKIQNTLHCCGVTDYRDWTDTNYYSEK-GFPKSCCKLEDCTPQ 
RDADKVNNEGCFIKVMTIIESEMGWAGISFGVACFQLIGIFLAYCXSRAITNNQYEIV 

Important features of the protein: 
Signal peptide: 

amino acids 1-4 2 

Transmembrane domains : 

amino acids 19-42, 61-83, 92-114, 209-230, 

N-glycosylation site. 

amino acids 134-138 

Tyrosine kinase phosphorylation site. 

amino acids 160-168, 160-169 

N-myristoylation site. 

amino acids 75-81, 78-84, 210-216, 214-220, 226-232 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 69-80, 211-222 
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FIGURE 3 



CCCACGCGTCCGGCGCCGTGGCCTCGCGTCCATCTTTGCCGTTCTCTCGGACCTGTCACAAA 
GGAGTCGCGCCGCCGCCGCCGCCCCCTCCCTCCGGTGGGCCCGGGAGGTAGAGAAAGTCAGT 
GCCACAGCCCGACCGCGCTGCTCTGAGCCCTGGGCACGCGGAACGGGAGGGAGTCTGAGGGT 
TGGGGACGTCTGTGAGGGAGGGGAACAGCCGCTCGAGCCTGGGGCGGGCGGACCGGACTGGG 
GCCGGGGTAGGCTCTGGAAAGGGCCCGGGAGAGAGGTGGCGTTGGTCAGAACCTGAGAAACA 
GC CG AG AGGTTT TCCACCGAGG C C CG CG CTTGAGGG ATCTG AAGAGGTT CCTAGAAG AGGGT 
GTTCCCTCTTTCGGGGGTCCTCACCAGAAGAGGTTCTTGGGGGTCGCCCTTCTGAGGAGGCT 
GCGGCTAACAGGGCCCAGAACTGCCATTGGATGTCCAGAATCCCCTGTAGTTGATAATGTTG 
GGAATAAGCTCTGCAACTTTCTTTGGCATTCAGTTGTTAAAAACAAATAGGATGCAAATTCC 
TCAACTCCAGGTTATGAAAACAGTACTTGGAAAACTGAAAACTACCTAAATGATCGTCTTTG 
GTTGGGCCGTGTTCTTAGCGAGCAGAAGCCTTGGCCAGGGTCTGTTGTTGACTCTCGAAGAG 
CACATAGCCCACTTCCTAGGGACTGGAGGTGCCGCTACTACCATGGGTAATTCCTGTATCTG 
CCGAG ATG ACAGTGGAACAGATG ACAGTGTTG AC AC CC AACAGCAAC AGG CCGAGAAC AGTG 
CAGTACCCACTGCTGACACAAGGAGCCAACCACGGGACCCTGTTCGGCCACCAAGGAGGGGC 
CGAGGAC CT CATGAGCCAAGGAGAAAGAAACAAAATGTGGATGGG CTAGT GTTGGACACACT 
GGCAGTAATACGGACTCTTGTAGATAAGTAAGTATCTGACTCACGGTCACCTCCAGTGGAAT 
GAAAAGTGTTCTGCCCGGAACCATGACTTTAGGACTCCTTCAGTTCCTTTAGGACATACTCG 
CCAAGCCTTGTGCT CACAGGGCAAAGGAGAATATTTTAATGCTCCGCTGATGGCAGAGTAAA 
TGATAAGATTTGATGTTTTTGCTTGCTGTCATCTACTTTGTCTGGAAATGTCTAAATGTTTC 
TGTAG CAGAAAACACG AT AAAGCTATGAT CTTTATT AGAG 
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FIGURE 4 



MIVFGWAVFLASRSLGQGLLLTLEEHIAHFLGTGGAATTMGNSCICRDDSGTDDSVDTQQQQ 
AENSAVPTADTRSQP.RDPVRPPRRGRGPHEPRRKKQNVDGLVLDTLAVIRTLVDKO 

Signal peptide: 

amino acids 1-16 

Casein kinase II phosphorylation site. 

amino acids 22-26, 50-54, 113-117 

N-myristoylation site. 

amino acids 18-24, 32-38, 34-40, 35-41, 51-57 
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FIGURE 5 



GGCACGAGGCGCTGTCCACCCGGGGGCGTGGGAGTGAGGTACCAGATTCAGCCCATTTGGCC 
CCGACGCCTCTGTTCTCGGAATCCGGGTGCTGCGGATTGAGGTCCCGGTTCCTAACGGACTG 
CAAGATGGAGGAAGGCGGGAACCTAGGAGGCCTGATTAAGATGGTCCATCTACTGGTCTTGT 
CAGGTGCCTGGGGCATGCAAATGTGGGTGACCTTCGTCTCAGGCTTCCTGCTTTTCCGAAGC 
CTTCCCCGACATACCTTCGGACTAGTGCAGAGCAAACTCTTCCCCTTCTACTTCCACATCTC 
CATGGGCTGTGCCTTCATCAACCTCTGCATCTTGGCTTCACAGCATGCTTGGGCTCAGCTCA 
CATTCTGGGAGGCCAGCCAGCTTTACCTGCTGTTCCTGAGCCTTACGCTGGCCACTGTCAAC 
GCCCGCTGGCTGGAACCCCGCACCACAGCTGCCATGTGGGCCCTGCAAACCGTGGAGAAGGA 
GCGAGGCCTGGGTGGGGAGGTACCAGGCAGCCACCAGGGTCCCGATCCCTACCGCCAGCTGC 
GAGAGAAGGACCCCAAGTACAGTGCTCTCCGCCAGAATTTCTTCCGCTACCATGGGCTGTCC 
TCTCTTTGCAATCTGGGCTGCGTCCTGAGCAATGGGCTCTGTCTCGCTGGCCTTGCCCTGGA 
AATAAGG AG C CT CTAGC ATGGG CCCTGCATGC TAATAAATGCTT CTTCAGAAATGAAAAAAA 
AAAAAAAAAAAA 
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FIGURE 6 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56107 
<subunit l of 1, 231 aa, 1 stop 
<NX(S/T) : 0 

MEEGGNLGGLIK^HLLVLSGAWGMQMWVTFVSGFLLFRSLPRHTFGLVQSKLFPFYFHISM 
GCAF INLC I LASQHAWAQLTFWEASQLYLLFLSLTLAT VNARWLEPRTTAAMWALQTVEKER 
GLGGE VPGSHQGPDP YRQLREKD PKYS ALRQNFFRYHGLS S LCNLGCVLSNGLCIiAGLALE I RS L 

Signal peptide: 

amino acids 1-24 

Transmembrane domain: 

amino acids 86-103, 60-75 

Casein kinase II phosphorylation site. 

amino acids 82-86 

Tyrosine kinase phosphorylation site. 

amino acids 144-151 

N-myristoylation site . 

amino acids 4-10, 5-11, 47-53, 170-176, 176-182 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 54-65 

G-protein coupled receptors proteins . 

amino acids 44-85 
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AATT C AGATTTTAAGC C CATT CTGC AGTGG AATTTC ATG AACT AGC AAGAGG AC ACC AT C TT 
CTTGTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCTCTTTTGGGT 
GCTAGGCCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAG 
ACATCACTGATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCC 
AGAACTTTTGATAAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAAC 
AGCTTTAAAGGCAGAAACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAG 
AGAATGTCAAGAGGACTGCCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGT 
CTGATCAATAATGCTGGTGTTCCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGA 
CTACAGAGAACCTATTGAAGTGAACCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTC 
CTTTGGTCAAGAAAGCTCAAGGGAGAGTTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCA 
ATCGTTGGAGGGGGCTATACTCCATCCAAATATGCAGTGGAAGGTTTCAATGACAGCTTAAG 
ACGGGACATGAAAGCTTTTGGTGTGCACGTCTCATGCATTGAACCAGGATTGTTCAAAACAA 
AC TTGG C AGAT CC AGTAAAGGTAATTGAAAAAAAACTCGC CATTTGGG AG C AGCTGT CT C C A 
GACATCAAACAACAATATGGAGAAGGTTACATTGAAAAAAGTCTAGACAAACTGAAAGGCAA 
TAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTGCATGGACCACGCTCTAACAA 
GTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAATTTTCTGGATACCTCTG 
TCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAAGCAGAGCTGGCTAA 
TCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGAAATTGGCCGAT 
TTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACTCATTTAGA 
TCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGTCCCTG 
CTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCTGT 
ATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATG 
ATCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTT 
AAGT AT CAT CTCTT AT CTAAATATTAAAAGATAAGTCAAC CC AAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA5 64 06 
xsubunit 1 of 1, 319 aa, 1 stop 
><MW: 35227, pi: 8.97, NX(S/T): 3 

MLFWVLGLL I LCGFLWTRKGKLKI ED I TDKY I F I TGCDSGFGNLAARTFDKKGFHVI AACLT 
ESGSTALKAETSERLRTVLLDVTDPENVKJITAQWVK^QVGEKGLWGLINNAGVPGVIJVPTDW 
LTLE DYRE P I EVNL FGL I S VTLNMLP LVKKAQGR VI NVS S VGGRLA I-VGGGYTPS KYAVEGF 
NDSLRRDMKAFGVHVSCIEPGLFKTNLADPVKVIEKKLAIWEQLSPDIKQQYGEGYIEKSLD 
KLKGNKSYVNMDLSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQK 
AELANPKAV 

Important features of the protein: 
Signal peptide : 

amino acids 1-17 

Transmembrane domain : 

amino acids 13 6-152 

N-glycosylation sites - 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 3 9-42 

N-myristoylation sites* 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 9 



GCGGGCTGTTGACGGCGCTGCGATGGCTGCCTGCGAGGGCAGGAGAAGCGGAGCTCTCGGTT 

CCTCTCAGTCGGACTTCCTGACGCCGCCAGTGGGCGGGGCCCCTTGGGCCGTCGCCACCACT 

GTAGTCATGTACCCACCGCCGCCGCCGCCGCCTCATCGGGACTTCATCTCGGTGACGCTGAG 

CTTTGGCGAGAGCTATGACAACAGCAAGAGTTGGCGGCGGCGCTCGTGCTGGAGGAAATGGA 

AGCAACTGTCGAGATTGCAGCGGAATATGATTCTCTTCCTCCTTGCCTTTCTGCTTTTCTGT 

GGACTCCTCTTCTACATCAACTTGGCTGACCATTGGAAAGCTCTGGCTTTCAGGCTAGAGGA 

AGAGCAGAAGATGAGGCCAGAAATTGCTGGGTTAAAACCAGCAAATCCACCCGTCTTACCAG 

CTCCTCAG AAGG CG GACACCGAC C CTGAGAACTT AC CTGAGATTT CGTC ACAGAAGACACAA 

AGAC AC AT CCAGCGGGGACC AC CT C ACCTG CAGATT AGAC CC CCAAGC CAAGAC CTGAAGGA 

TGGGACCCAGGAGGAGGCCACAAAAAGGCAAGAAGCCCCTGTGGATCCCCGCCCGGAAGGAG 

ATCCGC AGAGGACAGT CAT C AG CTGG AGGGGAGCGGTGATCGAGC CTGAGC AGGGC AC CG AG 

CTCCCTTCAAGAAGAGCAGAAGTGCCCACCAAGCCTCCCCTGCCACCGGCCAGGACACAGGG 

CACACCAGTGCATCTGAACTATCGCCAGAAGGGCGTGATTGACGTCTTCCTGCATGCATGGA 

AAGGATACCGCAAGTTTGCATGGGGCCATGACGAGCTGAAGCCTGTGTCCAGGTCCTTCAGT 

GAGTGGTTTGGCCTCGGTCTCACACTGATCGACGCGCTGGACACCATGTGGATCTTGGGTCT 

GAGGAAAGAATTTGAGGAAGCCAGGAAGTGGGTGTCGAAGAAGTTACACTTTGAAAAGGACG 

TGGACGTCAACCTGTTTGAGAGCACGATCCGCATCCTGGGGGGGCTCCTGAGTGCCTACCAC 

CTGTCTGGGGACAGCCTCTTCCTGAGGAAAGCTGAGGATTTTGGAAATCGGCTAATGCCTGC 

CTTCAGAACACCATCCAAGATTCCTTACTCGGATGTGAACATCGGTACTGGAGTTGCCCACC 

CGCCACGGTGGACCTCCGACAGCACTGTGGCCGAGGTGACCAGCATTCAGCTGGAGTTCCGG 

GAGCTCTCCCGTCTCACAGGGGATAAGAAGTTTCAGGAGGCAGTGGAGAAGGTGACACAGCA 

CATCCACGGCCTGTCTGGGAAGAAGGATGGGCTGGTGCCCATGTTCATCAATACCCACAGTG 

GCCTCTTCACCCACCTGGGCGTATTCACGCTGGGCGCCAGGGCCGACAGCTACTATGAGTAC 

CTGCTGAAGC AGTGGATC CAGGGCGGGAAGCAGGAGACACAG CTGCTGGAAGACTACGTGGA 

AGCCATCGAGGGTGTCAGAACGCACCTGCTGCGGCACTCCGAGCCCAGTAAGCTCACCTTTG 

TGGGGGAGCTTGCCCACGGCCGCTTCAGTGCCAAGATGGACCACCTGGTGTGCTTCCTGCCA 

GGGACGCTGGCTCTGGGCGTCTACCACGGCCTGCCCGCCAGCCACATGGAGCTGGCCCAGGA 

GCTCATGG AGACTT GT TACC AG ATGAACCGGCAGATGGAGACGGGGCTGAGT CC CGAGAT CG 

TGCACTTCAACCTTTACCCCCAGCCGGGCCGTCGGGACGTGGAGGTCAAGCCAGCAGACAGG 

CACAACCTGCTGCGGCCAGAGACCGTGGAGAGCCTGTTCTACCTGTACCGCGTCACAGGGGA 

CCGCAAATACCAGGACTGGGGCTGGGAGATTCTGCAGAGCTTCAGCCGATTCACACGGGTCC 

CCTCGGGTGGCTATTCTTCCATCAACAATGTCCAGGATCCTCAGAAGCCCGAGCCTAGGGAC 

AAGATGGAGAGCTTCTTCCTGGGGGAGACGCTCAAGTATCTGTTCTTGCTCTTCTCCGATGA 

CCCAAACCTGCTCAGCCTGGACGCCTACGTGTTCAACACCGAAGCCCACCCTCTGCCTATCT 

GGACCCCTGCCTAGGGTGGATGGCTGCTGGTGTGGGGACTTCGGGTGGGCAGAGGCACCTTG 

CTGGGTCTGTGGCATTTTCCAAGGGCCCACGTAGCACCGGCAACCGCCAAGTGGCCCAGGCT 

CTGAACTGGCTCTGGGCTCCTCCTCGTCTCTGCTTTAATCAGGACACCGTGAGGACAAGTGA 

GGCCGTCAGTCTTGGTGTGATGCGGGGTGGGCTGGGCCGCTGGAGCCTCCGCCTGCTTCCTC 

CAGAAGACACGAATCATGACTCACGATTGCTGAAGC CTGAGC AGGTCTCTGTGGGCCGACCA 

GAGGGGGGCTTCGAGGTGGTCCCTGGTACTGGGGTGACCGAGTGGACAGCCCAGGGTGCAGC 

TCTGCCCGGGCTCGTGAAGCCTCAGATGTCCCCAATCCAAGGGTCTGGAGGGGCTGCCGTGA 

CTCCAGAGGCCTGAGGCTCCAGGGCTGGCTCTGGTGTTTACAAGCTGGACTCAGGGATCCTC 

CTGGCCGCCCCGCAGGGGGCTTGGAGGGCTGGACGGCAAGTCCGTCTAGCTCACGGGCCCCT 

CCAGTGGAATGGGTCTTTTCGGTGGAGATAAAAGTTGATTTGCTCTAACCGCAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56529 
xsubunit 1 of 1, 699 aa, 1 stop 
><MW: 79553, pi: 7.83, NX(S/T): 0 

MAACEGRRSGALGSSQSDFLTPPVGGAPWAVATTWMYPPPPPPPHRDFISVTLSFGESYDN 
SKSWRRRSCWRKWKQLSRLQRNMILFLLAFLLFCGLLFYINLADHWKAIiAFRLEEEQKMRPE 
IAGLKPANPPVLPAPQKADTDPENLPEISSQKTQRHIQRGPPHLQIRPPSQDLKDGTQEEAT 
KRQEAPVDPRPEGDPQRTVISWRGAVIEPEQGTELPSRRAEVPTKPPLPPARTQGTPVHLNY 
RQKGVIDVFLHAWKGYRKFAWGHDELKPVSRSFSEWFGLGLTLIDALDTMWILGLRKEFEEA 
RKWVSKKLHFEKDVDVNLFESTIRILGGLLSAYHLSGDSLFLRKAEDFGNRLMPAFRTPSKI 
PYSDVNIGTGVAHPPRWTSDSTVAEVTSIQLEFRELSRLTGDKKFQEAVEKVTQHIHGLSGK 
KDGLVPMFINTHSGLFTHLGVFTLGARADSYYEYLLKQWIQGGKQETQLLEDYVEAIEGVRT 
HLLRHSEPSKLTFVGELAHGRFSAKMDHLVCFLPGTI4ALGVYHGLPASHMELAQELMETCYQ 
MNRQMETGLSPEIVHFNLYPQPGRRDVEVKPADRHNLLRPETVESLFYLYRVTGDRKYQDWG 
WEILQSFSRFTRVPSGGYSSINNVQDPQKPEPRDKMESFFIiGETLKYIiFLLFSDDPNLLSLD 

AYVFNTEAHPLP IWTPA 

Important features of the protein: 
Transmembrane domain: 

amino acids 21-40 and 84-105 (type II) 
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FIGURE 11 



GGCGCCGCGTAGGCCCGGGAGGCCGGGCCGGCCGGGCTGCGAGCGCCTGCCCCATGCGCCGC 
CGCCTCTCCGCACGATGTTCCCCTCGCGGAGGAAAGCGGCGCAGCTGCCCTGGGAGGACGGC 
AGGTCCGGGTTGCTCTCCGGCGGCCTCCCTCGGAAGTGTTCCGTCTTCCACCTGTTCGTGGC 
CTGCCTCTCGCTGGGCTTCTTCTCCCTACTCTGGCTGCAGCTCAGCTGCTCTGGGGACGTGG 
CCCGGGCAGTCAGGGGACAAGGGCAGGAGACCTCGGGCCCTCCCCGTGCCTGCCCCCCAGAG 
CCGCCCCCTGAGCACTGGGAAGAAGACGCATCCTGGGGCCCCCACCGCCTGGCAGTGCTGGT 
GCCCTTCCGCGAACGCTTCGAGGAGCTCCTGGTCTTCGTGCCCCACATGCGCCGCTTCCTGA 
GCAGGAAGAAGATC CGGC AC C A CATCTACGTGCTCAACCAGGTGGACCACTTCAGGTTCAAC 
CGGGCAGCGCTCATCAACGTGGGCTTCCTGGAGAGCAGCAACAGCACGGACTACATTGCCAT 
GCACGACGTTGACCTGCTCCCTCTCAACGAGGAGCTGGACTATGGCTTTCCTGAGGCTGGGC 
CCTTCCACGTGGCCTCCCCGGAGCTCCACCCTCTCTACCACTACAAGACCTATGTCGGCGGC 
ATCCTGCTGCTCTCCAAGCAGCACTACCGGCTGTGCAATGGGATGTCCAACCGCTTCTGGGG 
CTGGGGCCGCGAGGACGACGAGTTCTACCGGCGCATTAAGGGAGCTGGGCTCCAGCTTTTCC 
GCCCCTCGGGAATCACAACTGGGTACAAGACATTTCGCCACCTGCATGACCCAGCCTGGCGG 
AAGAGGGACCAGAAGCGCATCGCAGCTCAAAAACAGGAGCAGTTCAAGGTGGACAGGGAGGG 
AGGCCTGAACACTGTGAAGTACCATGTGGCTTCCCGCACTGCCCTGTCTGTGGGCGGGGCCC 
CCTGCACTGTCCTCAACATCATGTTGGACTGTGACAAGACCGCCACACCCTGGTGCACATTC 
AGCTGAGC TGGATGGACAGTGAGGAAG C CTGTAC C TACAG GC CATATTGCTCAGGCTCAGGA 
CAAGGCCTCAGGTCGTGGGCCCAGCTCTGACAGGATGTGGAGTGGCCAGGACCAAGACAGCA 
AGCTACGCAATTGCAGCCACCCGGCCGCCAAGGCAGGCTTGGGCTGGGCCAGGACACGTGGG 
GTGCCTGGGACGCTGCTTGCCATGCACAGTGATCAGAGAGAGGCTGGGGTGTGTCCTGTCCG 
GGACCCCCCCTGCCTTCCTGCTCACCCTACTCTGACCTCCTTCACGTGCCCAGGCCTGTGGG 
TAGTGGGGAGGGCTGAACAGGACAACCTCTCATCACCCTACTCTGACCTCCTTCACGTGCCC 
AGGCCTGTGGGTAGTGGGGAGGGCTGAACAGGACAACCTCTCATCACCCCCAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA56531 
xsubunit 1 of 1, 327 aa, 1 stop 
><MW: 37406, pi: 9.30, NX(S/T): 1 

MFPSRRKAAQLPWEDGRSGLLSGGLPRKCSVFHLFVACLSLGFFSLLWLQLSCSGDVARAVR 
GQGQETSGPPRACPPEPPPEHWEEDASWGPHRLAVLVPFRERFEELLVFVPHMRRFLSRKKI 
RHHIYVLNQVDHFRFNRAALINVGFLESSNSTDYIAMHDVDLLPLNEELDYGFPEAGPFHVA 
S PELHP LYHYKT YVGG I LLL S KQH YRLCNGMSNRFWGWGREDDE F YRR I KGAGLQL FR PS G I 
TTGYKT FRHLHD P AWRKRDQ KR IAAQKQEQFKVDREGGLNTVKYHVASRTALS VGGAP CTVL 
NIMLDCDKTATPWCTFS 

Signal peptide: 

amino acids 1-42 

Transmembrane domain: 

amino acids 29-49 (type II) 

N-glycosylation site. 

amino acids 154-158 

CAMP- and cGMP- dependent protein kinase phosphorylation site* 

amino acids 27-31 

Tyrosine kinase phosphorylation site. 

amino acids 226-233 

N-myristoylation site . 

amino acids 19-25, 65-71, 247-253, 285-291, 303-309, 304-310 



• 




WO 00/12708 



PCT/US99/2011I 



FIGURE 13 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCT 
GCTGCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCT 
CTTCTGCCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTC 
CTGCCTCATCGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGT 
GGGGGATGGC TAA GAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGG 
GCTGCCTTATTTAAAGTGGTTGTTTATGATTCTTATACTAATTTATACAAAGATATTAAGGC 
CCTGTTCATTAAGAAATTGTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGT 
AAATATGTCTTTATAATAAACAGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA56862 
<subunit 1 of 1, 73 aa, 1 stop 
<MW: 7879, pi: 7.21, NX(S/T): 0 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQ 
PRGEGEKVGDG 

Important features : 
Signal peptide: 

amino acids 1-15 

Growth, factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE 15 



GGGACCCATGCGGCCGTGACCCCCGGCTCCCTAGAGGCCCAGCGCAGCCGCAGCGGACAAAG 

GAGCATGTCCGCGCCGGGGAAGGCCCGTCCTCCGGCCGCCATAAGGCTCCGGTCGCCGCTGG 

GCCCGCGCCGCGCTCCTGCCCGCCCGGGCTCCGGGGCGGCCCGCTAGGCCAGTGCGCCGCCG 

CTCGCCCCGCAGGCCCCGGCCCGCAGCATGGAGCCACCCGGACGCCGGCGGGGCCGCGCGCA 

GCCGCCGCTGTTGCTGCCGCTCTCGCTGTTAGCGCTGCTCGCGCTGCTGGGAGGCGGCGGCG 

GCGGCGGCGCCGCGGCGCTGCCCGCCGGCTGCAAGCACGATGGGCGGCCCCGAGGGGCTGGC 

AGGGCGGCGGGCGCCGCCGAGGGCAAGGTGGTGTGCAGCAGCCTGGAACTCGCGCAGGTCCT 

GCCCCCAGATACTCTGCCCAACCGCACGGTCACCCTGATTCTGAGTAACAATAAGATATCCG 

AGCTGAAGAATGGCTCATTTTCTGGGTTAAGTCTCCTTGAAAGATTGGACCTCCGAAACAAT 

CTTATTAGTAGTATAGATCCAGGTGCCTTCTGGGGACTGTCATCTCTAAAAAGATTGGATCT 

GACAAACAATCGAATAGGATGTCTGAATGCAGACATATTTCGAGGACTCACCAATCTGGTTC 

GGCTAAACCTTTCGGGGAATTTGTTTTCTTCATTATCTCAAGGAACTTTTGATTATCTTGCG 

TCATTACGGT CTTTGGAATT CC AGACTGAGTATCTTTTGTGTGACTGTAACATACTGTGGAT 

GCATCGCTGGGTAAAGGAGAAGAACATCACGGTACGGGATACCAGGTGTGTTTATCCTAAGT 

CACTGCAGGCCCAACCAGTCACAGGCGTGAAGCAGGAGCTGTTGACATGCGACCCTCCGCTT 

GAATTGCCGTCTTTCTACATGACTCCATCTCATCGCCAAGTTGTGTTTGAAGGAGACAGCCT 

T C CTTT CC AGTG CATGGCTTC ATATATTGAT CAGGACATG CAAGTGTTGTGGTATC AGGATG 

GG AGAATAGTTG AAAC CGATGAATCGC AAGGT ATTT TT GT TG AAAAGAAC ATG ATTC ACAAC 

TGCTCCTTGATTGCAAGTGCCCTAACCATTTCTAATATTCAGGCTGGATCTACTGGAAATTG 

GGGCTGTC ATGT C C AGACCAAACGTGGGAAT AAT ACGAGG AC TGTGGATATTGTGGT ATTAG 

AGAGTTCTGCAC AGTACTGT C C T C CAGAGAGGGTGGTAAACAACAAAGGTG ACTTC AGATGG 

CCCAGAACATTGGCAGGCATTACTGCATATCTGCAGTGTACGCGGAACACCCATGGCAGTGG 

GATATAT CC CGG AAAC CC AC AGGATG AGAGAAAAGCTTGG CG CAGATG TGATAGAGGTGGCT 

TTTGGGC AGATG ATGATT ATTC TCG CTGT C AGT ATG C AAATG ATGT CACTAGAGTTCTTTAT 

ATGTTTAATCAGATGCCCCTCAATCTTACCAATGCCGTGGCAACAGCTCGACAGTTACTGGC 

TT ACACTGTGGAAG CAGC CAAC TTTT CTGACAAAATGGATGTTATATTTGTGGCAGAAATGA 

TTGAAAAATTTGGAAG AT TTAC CAAGGAGG AAAAATCAAAAGAGCTAGGTGACGTGATGGTT 

GACATTGCAAGTAACATCATGTTGGCTGATGAACGTGTCCTGTGGCTGGCGCAGAGGGAAGC 

TAAAGCCTGCAGTAGGATTGTGCAGTGTCTTCAGCGCATTGCTACCTACCGGCTAGCCGGTG 

GAGCTCACGTTT ATTC AACATATTCACCCAATATTGCTCTGGAAGCTT ATGT CATC AAGTCT 

ACTGGCTTCACGGGGATGACCTGTACCGTGTTCCAGAAAGTGGCAGCCTCTGATCGTACAGG 

ACTTT CGG AT TATGGGAGGCGGGATCC AGAGGGAAACC TGGATAAGCAGC TG AGCTTT AAGT 

GCAATGTTTCAAATACATTTTCGAGTCTGGCACTAAAGGTATGTTACATTCTGCAATCATTT 

AAGACTATTTACAGT TAAA TTAGAATGCTCCAAATGTTCTGCTTCGCAAAATAACCTTATTA 

AAAGATTTTTTTTTGCAGGAAGATAGGTATTATTGCTTTTGCTACTGTTTTAAAGAAAACTA 

ACCAGGAAGAACTGCATTACGACTTTCAAGGGCCCTAGGCATTTTTGCCTTTGATTCCCTTT 

CTTCACATAAAAATAT C AG AAATT AC ATTTT ATAACTGC AGTGGTATAAATG CAAAT ATACT 

ATTGTTACATGTGAAAAAATTTTATTTGACTTAAAAGTTTATTTATTTGTTTTTTTGCTCCT 

GATTTTAAGACAATAAG ATGTTTT CATGGG CC CCTAAAAGTAT CATGAG C CTTTGGCACTGC 

GCCTGCCAAGC CTAGTGGAG AAGT C AAC CCTGAGACCAGGTGTTTAAT C AAGCAAGCTGT AT 

ATCAAAATTTTTGG CAGAAAAC ACAAAT ATGT CAT ATATCTTTTTTTAAAAAAAGTATTTCA 

TTGAAG CAAGCAAAATGAAAGC ATTTTTAC TG ATTTTT AAAATTGGTGCTTTAGATATATTT 

GACTAC ACTG TATTGAAG C AAATAGAGGAGGC AC AAC T CC AGCACC CTAATGGAACCACATT 

TTTTTCACTTAGCTTTCTGTGGGCATGTGTAATTGTATTCTCTGCGGTTTTTAATCTCACAG 

TACTTT ATTT CTGTCTTGTC CCTCAATAATATCACAAACAATATTCCAGTCATTTTAATGGC 

TGCATAATAACTGATCCAACAGGTGTTAGGTGTTCTGGTTTAGTGTGAGCACTCAATAAATA 

TTGAATGAATGAACGAAAAAAAAAAAAAAAA 
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FIGURE 16 



MEPPGRRRGRAQPPLLLPLSLLALLALLGGGGGGGAAALPAGCKHDGRPRGAGRAAGAAEGK 
WCSSLELAQVLPPDTLPNRTVTLILSNNKISELKNGSFSGLSLLERLDLRNNLISSIDPGA 
FWGLSSLKRLDLTNNRIGCLNADIFRGLTNLVRLNLSGNLFSSLSQGTFDYLASLRSLEFQT 
EYLLCDCNILWMHRWVKEKNITVRDTRCVYPKSLQAQPVTGVKQELLTCDPPLELPSFYMTP 
SHRQWFEGDSLPFQCMASYIDQDMQVLWYQDGRIVETDESQGIFVEKNMIHNCSLIASALT 
I S NI QAGS TGNWGCHVQTKRGNNTRT VD I WLES S AQ YCPPERWNNKGDFRWPRTLAG I TA 
YLQCTRNTHGSGIYPGNPQDERKAWRRCDRGGFWADDDYSRCQYANDVTRVLYMFNQMPLNL 
TNAVATARQLLAYTVEAANFSDKMDVIFVAEMIEKFGRFTKEEKSKELGDVMVDIASNIMLA 
DERVLWLAQRE AKACS R I VQ CLQR I ATYRLAGGAHVYSTYS PN I ALEA YV I KS TGFTGMTCT 
VFQKVAASDRTGLSDYGRRDPEGNLDKQLSFKCNVSNTFSSLALKVCYILQSFKTIYS 

Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 13-40 (type II) 

N-glycosylation site. 

amino acids 81-85, 98-102, 159-163, 206-210, 301-305, 332-336, 
433-437, 453-457, 592-596 

N-myristoylation site. 

amino acids 29-35, 30-36, 31-37, 32-38, 33-39, 34-40, 51-57, 
57-63, 99-105, 123-129, 142-148, 162-168, 317-323, 320-326, 
384-390, 403-409, 554-560 
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FIGURE 17 



GCGTGGGGATGTCTAGGAGCTCGAAGGTGGTGCTGGGCCTCTCGGTGCTGCTGACGGCGGCC 
ACAGTGGCCGGCGTACATGTGAAGCAGCAGTGGGACCAGCAGAGGCTTCGTGACGGAGTTAT 
CAGAGACATTGAGAGGCAAATTCGGAAAAAAGAAAACATTCGTCTTTTGGGAGAACAGATTA 
TTTTGACTGAGCAACTTGAAG CAG AAAG AG AG AAGATGTT ATTGGC AAAAGG AT CT C AAAAA 
TCATGACTTGAATGTGAAATATCTGTTGGACAGACAACACGAGTTTGTGTGTGTGTGTTGAT 
GGAGAGTAGCTTAGTAGTATCTTCATCTTTTTTTTTGGTCACTGTCCTTTTAAACTTGATCA 
AATAAAGGACAGTGGGTCATATAAGTTACTGCTTTCAGGGTCCCTTATATCTGAATAAAGGA 
GTGTGGGCAGACACTTTTTGGAAGAGTCTGTCTGGGTGATCCTGGTAGAAGCCCCATTAGGG 
TCACTGTCCAGTGCTTAGGGTTGTTACTGAGAAGCACTGCCGAGCTTGTGAGAAGGAAGGGA 
TGGATAGT AG CATC CAC CTG AGTAGT CTGATC AG T CGG CATGATGACGAAG CCACGAGAAC A 
TCG AC CT CAGAAGGACTGGAGGAAGGTGAAGTGG AGGGAG AGACG CTCCTGATCGT CGAAT CC 
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FIGURE 18 

MSRS SKWLGLSVLLTAATVAGVHVKQQWDQQRLRDGVI RD I ERQ I RKKENI RLLGEQ 1 1 LT 
EQLEAEREKMLLAKGSQKS 

Signal peptide: 

amino acids 1-21 
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FIGURE 19 



CTGTCGTCTTTGCTTCAGCCGCAGTCGCCACTGGCTGCCTGAGGTGCTCTTACAGCCTGTTC 
CAAGTGTGGCTTAATCCGTCTCCACCACCAGATCTTTCTCCGTGGATTCCTCTGCTAAGACC 
GCTGCCATGCCAGTGACGGTAACCCGCACCACCATCACAACCACCACGACGTCATCTTCGGG 
CCTGGGGTCCCCCATGATCGTGGGGTCCCCTCGGGCCCTGACACAGCCCCTGGGTCTCCTTCGC 
CTGCTGCAGCTGGTGTCTACCTGCGTGGCCTTCTCGCTGGTGGCTAGCGTGGGCGCCTGGAC 
GGGGTCCATGGGCAACTGGTCCATGTTCACCTGGTGCTTCTGCTTCTCCGTGACCCTGATCA 
TCCTCATCGTGGAGCTGTGCGGGCTCCAGGCCCGCTTCCCCCTGTCTTGGCGCAACTTCCCC 
ATCACCTTCGCCTGCTATGCGGCCCTCTTCTGCCTCTCGGCCTCCATCATCTACCCCACCAC 
CTATGTCCAGTTCCTGTCCCACGGCCGTTCGCGGGACCACGCCATCGCCGCCACCTTCTTCT 
CCTGCATCGCGTGTGTGGCTTACGCCACCGAAGTGGCCTGGACCCGGGCCCGGCCCGGCGAG 
ATCACTGGCTATATGGCCACCGTACCCGGGCTGCTGAAGGTGCTGGAGACCTTCGTTGCCTG 
CATCATCTTCGCGTTCATCAGCGACCCCAACCTGTACCAGCACCAGCCGGCCCTGGAGTGGT 
GCGTGGCGGTGTACGCCATCTGCTTCATCCTAGCGGCCATCGCCATCCTGCTGAACCTGGGG 
GAGTGCACCAACGTGCTACCCATCCCCTTCCCCAGCTTCCTGTCGGGGCTGGCCTTGCTGTC 
TGTCCTCCTCTATGCCACCGCCCTTGTTCTCTGGCCCCTCTACCAGTTCGATGAGAAGTATG 
GCGGCCAGCCTCGGCGCTCGAGAGATGTAAGCTGCAGCCGCAGCCATGCCTACTACGTGTGT 
GCCTGGGACCGCCGACTGGCTGTGGCCATCCTGACGGCCATCAACCTACTGGCGTATGTGGC 
TGACCTGGTGCACTCTGCCCACCTGGTTTTTGTCAAGGTCTAAGACTCTCCCAAGAGGCTCC 
CGTTCCCTCTCCAACCTCTTTGTTCTTCTTGCCCGAGTTTTCTTTATGGAGTACTTCTTTCC 

CAATTCCTTGCACTCTAACCAGTTCTTGGATGCATCTTCTTCCTTCCCTTTCCTCTTGCTGT 
TTCCTTCCTGTGTTGTTTTGTTGCCCACATCCTGTTTTCACCCCTGAGCTGTTTCTCTTTTT 
CTTTTCTTTCTTTTTTTTTTTTTTTTTTAAGACGGATTCTCACTCTGTGGCCCAGGCTGGAG 
TGCAGTGGTGCGATCTCAGCTCACTGCAACCCCCGCCTCCTGGGTTCAAGCGATTCTCCTCC 
CCCAGCCTCCCAAGTAGCTGGGAGGACAGGTGTGAGCTGCCGCACCCAGCCTGTTTCTCTTT 
TTCCACTCTTCTTTTTTCTCATCTCTTTTCTGGGTTGCCTGTCGGCTTTCTTATCTGCCTGT 
TTTGCAAGCACCTTCTCCTGTGTCCTTGGGAGCCCTGAGACTTCTTTCTCTCCTTGCCTCCA 
CCCACCTCCAAAGGTGCTGAGCTCACATCCACACCCCTTGCAGCCGTCCATGCCACAGCCCC 
CCAAGGGGCCCCATTGCCAAAGCATGCCTGCCCACCCTCGCTGTGCCTTAGTCAGTGTGTAC 
GTGTGTGTGTGTGTGTGTTTGGGGGGTGGGGGGTGGGTAGCTGGGGATTGGGCCCTCTTTCT 
CC CAGTGG AGGAAGGTGTG CAGTGTACTTCCC CTTT AAATTAAAAAACAT AT AT ATATAT AT 
ATTTGGAGGTCAGTAATTTCCAATGGGCGGGAGGCATTAAGCACCGACCCTGGGTCCCTAGG 
CCCCGCCTGGCACTCAGCCTTGCCAGAGATTGGCT CCAGAATTTTTG CCAGGCTTACAGAACAC 
CCACTGCCTAGAGGCCATCTTAAAGGAAGCAGGGGCTGGATGCCTTTCATCCCAACTATTCT 
CTGTGGTATGAAAAAG 
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FIGURE 20 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA58727 
<subunit 1 of 1, 322 aa, 1 stop 
<MW: 35274, pi: 8.57, NX(S/T): 1 

MPVTVTRTTITTTTTSSSGLGSPMIVGSPRALTQPLGLLRLLQLVSTCVAFSLVASVGAWTG 
SMGNWSMFTWCFCFSVTLIILIVELCGLQARFPLSWRNFPITFACYAALFCLSASIIYPTTY 
VQFLSHGRSRDHAIAATFFSCIACVAYATEVAWTRARPGEITGYMATVPGLLKVLETFVACI 
IFAFISDPNLYQHQPALEWCVAVYAICFILAAIAILLNLGE 

LL YATALiVLWPL YQ F DE KYGGQ PRRS RD VS C SR SHAYYVCAWDRRLAVA I LT A I NLL AYVAD 
LVHSAHLVFVKV 

Important features : 
Transmembrane domains : 

amino acids 41-60 (type II), 66-85, 101-120, 137-153, 171-192, 
205-226, 235-255 and 294-312 

N-glycosylation site. 

amino acids 66-69 

Glycosaminoglycan attachment site. 

amino acids 18-21 
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FIGURE 21 



GAACGTGCCACCATGCCCAGCTAATTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTC 

TTGAACTCGTGACCTCATGATCCGCTCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACTGACGC 

CTGGCCAGCCTATGCATTTTTAAGAAATTATTCTGTATTAGGTGCTGTGCTAAACATTGGGCACTACAGTGACCA 

AAACAGACTGAATTCCCCAAGAGCCAAAGACCAGTGAGGGAGACCAACAAGAAACAGGAAATGCAAAAGAGACCA 

TTATTACTCACTATGACTAAGGGTCACAAATGGGGTACGTTGATGGAGAGTGATTTGTTAAGAGACTACAGAGGG 

AGGACAGACTACCAAGAGGGGGGCCAGGAAAGCTCCTCTGACGAGGTGGTATTTCAGCCCAAACTGGAAGAATGA 

GAAAGAGCTAGCCAGCCATCAGAATAGTCCAGAAGAGATGGGGAGCACTACACTCACTACACTTTGGCCTGAGAA 

AATAGCATGGGATTGGAGGAGGCTGGGGGAACACCACTTCTGCCGACCTGGGCAGGAGGCATTGAGGGCTTGAGA 

AAGGGCAATGGCAGTAGCAGTAGAAAGGACAGGGTAGGAGCAGGGACTTTGCAGGTGGAATCATTAGGTCTTATC 

AACAGATATGGGCAAGCAAAGCCAGGGGAGAATTGATGGTAATGCTGAGGTTTGGAGCCAGGCTAGATGGGACAG 

TGGTGGGTGATGCAAAGGAAAGAGGTCAGGAAGCAGGGCCAGACGTGGGGAGAAGGTGTGGGGGTTTGGTTTCCA 

TCTTGCCGAGTCTGCCGGAATGTGGATGGGAAGACCAAGAGGAGGAGCAAGGGGCAGAGGGGAAGGGAATCTT71A 

AGAAGTCCTGGATGCCACACTCTTCTTCCTTCCTCCTCTTCCCTCTCCTCAGAGGTCTCACTCGTGGTTCTTCAT 

TTCCTGCCCTGCCTCCATCTCCTCTGGGTGCTGGGAAAGTGGAGGATTAGCTGAAGTTTTGCTTCTCGGGGCCTG 

TCTGAATCTCCATTGCTTTCTGGGAGGACATAATTC^CCTGTCCTAGCTTCTTATCATCTTACATTTCCCTGTAG 

CCACTGGGACATATGTGGTGTTCCTTCCTAGCTCCTGTCTCCTCCTCATGCCTTTGCTGGGTATGGGCATGTTAG 

GGGGAAGGTCATTGCTGTCAGAGGGGCACTGACTTTCTAATGGTGTTACCCAAGGTGAATGTTGGJ\GACACAGTC 

GCGATGCTGCCCAAGTCCCGGCGAGCCCTAACTATCCAGGAGATCGCTGCGCTGGCCAGGTCCTCCCTGCATGGT 

ATGCAGCCCCTCCCATGTTTCTGGCCACTTTGTCCTTTCTCCTCCCGTTTGCACATCCCTTTGGAACTGTTTCCT 

GTGAGTACATGCTGGGGTCTCCCCTTTCTTCCCTTGCTCAGGTGAATCTCAGCCCCTTCTCCCACCCAAAGGTTC 

ACATGGATCCTAACTACTGCCACCCTTCCACCTCCCTGCACCTGTGCTCCCTGGCCTGGTCCTTTACCAGGCTTC 

TCCACCCTCCCCTATCTCCAGGTATTTCCCAGGTGGTGAAGGACCACGTGACCAAGCCTACCGCCATGGCCCAGG 

GCCGAGTGGCTCACCTCATTGAGTGGAAGGGCTGGAGCAAGCCGAGTGACTCACCTGCTGCCCTGGAATCAGCCT 

TTTC CTCCTATTC AGACCTC AGCGAGGGCGAAC AAG AGGCTCG CTTTGCAGC AGG AGTGG CTGAGCAGTTT GC C A 

TCGCGGAAGCCAAGCTCCGAGCATGGTCTTCGGTGGATGGCGAGGACTCCACTGATGACTCCTATGATGAGGACT 

TTGCTGGGGGAATGGACACAGACATGGCTGGGCAGCTGCCCCTGGGGCCGCACCTCCAGGACCTGTTCACCGGCC 

ACCGGTTCTCCCGGCCTGTGCGCCAGGGCTCCGTGGAGCCTGAGAGCGACTGCTCACAGACCGTGTCCCCAGACA 

CCCTGTGCTCTAGTCTGTGCAGCCTGGAGGATGGGTTGTTGGGCTCCCCGGCCCGGCTGGCCTCCCAGCTGCTGG 

GCGATGAGCTGCTTCTCGCCAAACTGCCCCCCAGCCGGGAAAGTGCCTTCCGCAGCCTGGGCCCACTGGAGGCCC 

AGGACTCACTCTACAACTCGCCCCTCACAGAGTCCTGCCTTTCCCCCGCGGAGGAGGAGCCAGCCCCCTGCAAGG 

ACTGCCAGCCACTCTGCCCACCACTAACGGGCAGCTGGGAACGGCAGCGGCAAGCCTCTGACCTGGCCTCTTCTG 

GGGTGGTGTCCTTAGATGAGGATGAGGCAGAGCCAGAGGAACA GTGA CCCACATCATGCCTGGCAGTGGCATGCA 

TCCCCCGGCTGCTGCCAGGGGCAGAGCCTCTGTGCCCAAGTGTGGGCTCAAGGCTCCCAGCAGAGCTCCACAGCC 

TAGAGGGCTCCTGGGAGCGCTCGCTTCTCCGTTGTGTGTTTTGCATGAAAGTGTTTGGAGAGGAGGC71GGGGCTG 

GGCTGGGGGCGCATGTCCTGCCCCCACTCCCGGGGCTTGCCGGGGGTTGCCCGGGGCCTCTGGGGCATGGCTACA 

GCTGTGGCAGACAGTGATGTTCATGTTCTTAAAATGCCACACACACATTTCCTCCTCGGATAATGTGAACCACTA 

AGGGGGTTGTGACTGGGCTGTGTGAGGGTGGGGTGGGAGGGGGCCCAGCAACCCCCCACCCTCCCCATGCCTCTC 

TCTTCTCTGCTTTTCTTCTCACTTCCGAGTCCATGTGCAGTGCTTGATAGAATCACCCCCACCTGGAGGGGCTGG 

CTCCTGCCCTCCCGGAGCCTATGGGTTGAGCCGTCCCTCAAGGGCCCCTGCCCAGCTGGGCTCGTGCTGTGCTTC 

ATTC^CCTCTCCATCGTCTCTAAATCITCCTCTTTTTTCCTAAAGACAGAAGGTTTTTGGTCTGTTTTTTCAGTC 

GGATCTTCTCTTCTCTGGGAGGCTTTGGAATGATGAAAGCATGTACCCTCCACCCTTTTCCTGGCCCCCTAATGG 

GGCCTGGGCCCTTTCCCAACCCCTCCTAGGATGTGCGGGCAGTGTGCTGGCGCCTCACAGCCAGCCGGGCTGCCC 

ATTCACGCAGAGCTCTCTGAGCGGGAGGTGGAAGAAAGGATGGCTCTGGTTGCCACAGAGCTGGGACTTCATGTT 

CTTCTAGAGAGGGCCACAAGAGGGCCACAGGGGTGGCCGGGAGTTGTCAGCTGATGCCTGCTGAGAGGCAGGAAT 

TGTGCCAGTGAGTGACAGTCATGAGGGAGTGTCTCTTCTTGGGGAGGAAAGAAGGTAGAGCCTTTCTGTCTGAAT 

GAAAGGCCAAGGCTACAGTACAGGGCCCCGCCCCAGCCAGGGTGTTAATGCCCACGTAGTGGAGGCCTCTGGCAG 

ATCCTGCATTCCAAGGTCACTGGACTGTACGTTTTTATGGTTGTGGGAAGGGTGGGTGGCTTTAGAATTAAGGG 

CTTGTAGGCTTTGGCAGGTAAGAGGGCCCAAGGTAAGAACGAGAGCCAACGGGCACAAGCATTCTATATATAAGT 

GGCTCATTAGGTGTTTATTTTGTTCTATTTAAGAATTTGTTTTATTAAATTAATATAAAAATCTTTGTAAATCTC 

TAAAA 
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FIGURE 22 



MFLATLSFLLPFAHPFGTVSCEYMLGSPLSSLAQVNLSPFSHPKVHMDPNYCHPSTSLHLCS 
LAWSFTRLLHPPLSPGISQWKDHVTKPTAMAQGRVAHLIEWKGWSKPSDSPAALESAFSSY 
SDLSEGEQEARFAAGVAEQFAIAEAKLRAWSSVDGEDSTDDSYDEDFAGGMDTDMAGQLPLG 
PHLQDLFTGHRFSRPVRQGSVEPESDCSQTVSPDTLCSSLCSLEDGLLGSPARLASQLLGDE 
LLLAKLPPSRESAFRSLiGPLEAQDSLYNSPLTESCLSPAEEEPAPCKDCQPLCPPLTGSWER 
QRQASDLASSGWSLDEDEAEPEEQ 

Signal peptide: 

amino acids 1-15 

Casein kinase II phosphorylation site. 

amino acids 123-127, 128-132, 155-159, 162-166, 166-170, 228-232, 
285-289, 324-328 

Tyrosine kinase phosphorylation site. 

amino acids 44-52 

N-myristoylation site . 

amino acids 17-23, 26-32, 173-179 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 11-22 
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FIGURE 23 



GGTTCCTGGGCGCTCTGTTACACAAGCAAGATACAGCCAGCCCCACCTAATTTTGTTTCCCT 
GGCACCCTCCTGCTCAGTGCGACATTGTCACACTTAACCCATCTGTTTTCTCTAATGCACGA 
CAGATTCCTTTCAGACAGGACAACTGTGATATTTCAGTTCCTGATTGTAAATACCTCCTAAG 
CCTGAAGCTTCTGTTACTAGCCATTGTGAGCTTCAGTTTCTTCATCTGCAAAATGGGCATAA 
TACAATCTATTCTTGCCACATCAAGGGATTGTTATTCCTTTAAAAAAAAACCAATACCAAAG 
AAGCCTACAATGTTGGCCTTAGCCAAAATTCTGTTGATTTCAACGTTGTTTTATTCACTTCT 
ATCGGGG AGC C ATGGAAAAGAAAATCAAG ACATAAACACAAC ACAGAAC ATTGCAGAAGTT T 
TTAAAACAATGGAAAAT AAAC CT ATTT GTTTGG AAAGTG AAGC AAAC TT AAACTCAGATAAA 
GAAAATATAACCACCTCAAATCTCAAGGCGAGTCATTCCCCTCCTTTGAATCTACCCAACAA 
CAGCCACGGAATAACAGATTTCTC CAGTAACTCATC AG CAGAGCATTCTTTGGGCAGTCTAA 
AACCCACATCTACCATTTCCACAAGCCCTCCCTTGATCCATAGCTTTGTTTCTAAAGTGCCT 
TGG AATGCAC CT ATAGC AG ATGAAG AT CT TTTGC C CAT CTC AG CACATC CC AATG CTAC AC C 
TGCTCTGTCTTCAGAAAACTTCACTTGGTCTTTGGTCAATGACACCGTGAAAACTCCTGATA 
ACAGTTCCATTACAGTTAGCATCCTCTCTTCAGAACCAACTTCTCCATCTGTGACCCCCTTG 
AT AGTGG AAC CAAGTGGATGG CTTAC CACAAACAGTGATAGCTT C AC TGGGTTTAC C C CTT A 
TC AAGAAAAAACAACTCTACAGC CT AC CTTAAAATT C AC CAATAATT CAAAACT CTTTC C AA 
ATACGT CAGATC CC CAAAAAGAAAATAGAAATACAGGAATAGTATT CGGGGCCATTTTAGGT 
GCTATTCTGGGTGTCTCATTGCTTACTCTTGTGGGCTACTTGTTGTGTGGAAAAAGGAAAAC 
GGATTCATTTTCCCATCGGCGACTTTATGACGACAGAAATGAACCAGTTCTGCGATTAGACA 
ATGCACCGGAACCTTATGATGTGAGTTTTGGGAATTCTAGCTACTACAATCCAACTTTGAAT 
GATTCAGCCATGCCAGAAAGTGAAGAAAATGCACGTGATGGCATTCCTATGGATGACATACC 
T C CACTTCGTACTTCTGTATAGAACTAACAGC AAAAAGG CGTT AAACAGCAAGTGT CATCTA 
CATCCTAGCCTTTTGACAAATTCATCTTTCAAAAGGTTACACAAAATTACTGTCACGTGGAT 
TTTGTC AAGGAGAATCATAAAAG CAGGAG AC CAGTAGCAGAAATGTAGACAGGATGT AT CAT 
CCAAAGGTTTTCTTTCTTACAATTTTTGGCCATCCTGAGGCATTTACTAAGTAGCCTTAATT 
TGTATTTTAGTAGTATTTTCTTAGTAGAAAATATTTGTGGAATCAGATAAAACTAAAAGATT 
TCAC CATT AC AGCC CTGC CT CAT AACT AAATAAT AAAAATTATTCCACC AAAAAATT CTAAA 
AC AATG AAGATGACTCTTTACTG CTCTGC CTGAAG CC CTAGTACC AT AATT CAAGATTG CAT 
TT TCTT AAATGAAAATTG AAAGGGTG CT TTTT AAAG AAAATTTGACTTAAAGCTAAAAAG AG 
GACATAGC CCAGAGTTTCTGTT ATTGGGAAATTGAGGC AATAGAAATG AC AGAC CTGT ATTC 
TAGTACGTTATAATTTTCTAGATCAGCACACACATGATCAGCCCACTGAGTTATGAAGCTGA 
CAATGACTGCATTCAACGGGGCCATGGCAGGAAAGCTGACCCTACCCAGGAAAGTAATAGCT 
TCTTTAAAAGTCTTCAAAGGTTTTGGGAATTTTAACTTGTCTTAATATATCTTAGGCTTCAA 
TTATTTGGGTGCCTTAAAAACTCAATGAGAATCATGGT 
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FIGURE 24 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA58732 
xsubunit 1 of 1, 334 aa, 1 stop 
><MW: 36294, pi: 4.98, NX(S/T): 13 

MLALAKILLISTLFYSLLSGSHGKENQDINTTQNIAEVFKTMENKPISLESEANLNSDKENI 
TTSNLKASHSPPLNLPNNSHGITDFSSNSSAEHSLGSLKPTSTISTSPPLIHSFVSKVPWNA 
PIADEDLLPISAHPNATPALSSENFTWSLV1TOTVKTPDNSSIWSILSSEPTSPSVTPLIVE 
PSGWLTTNSDSFTGFTPYQEKTTLQPTLKFTNNSKLFPNTSDPQKENRNTGIVFGAILGAIL 
GVSLLTLVGYLLCGKRKTDSFSHRRLYDDRNEPVLRLDNAPEPYDVSFGNSSYYNPTLNDSA 
MPESEENARDG I PMDD I PPLRTSV 

Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 235-2 62 

N-glycosylation site. 

amino acids 30-34, 61-65, 79-83, 90-94, 148-152, 155-159, 
163-167, 218-222, 225-229, 298-302, 307-311 
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FIGURE 25 

AACAGGATCTCCTCTTGCAGTCTGCAGCCCAGGACGCTGATTCCAGCAGCGCCTTACCGCGC 
AG CC CG AAGATTCACTATGGTGAAAATCG C CT T C AATA C C C CTACCG C CGTG CAAAAGGAGG 
AGGCGCGGCAAGACGTGGAGGCCCTCCTGAGCCGCACGGTCAGAACTCAGATACTGACCGGC 
AAGGAGCTCCGAGTTGCCACCCAGGAAAAAGAGGGCTCCTCTGGGAGATGTATGCTTACTCT 
CTTAGGCCTTTCATTCATCTTGGCAGGACTTATTGTTGGTGGAGCCTGCATTTACAAGTACT 
TCATGCCCAAGAGCACCATTTACCGTGGAGAGATGTGCTTTTTTGAT-TCTGAGGATCCTGCA 
AATTCCCTTCGTGGAGGAGAGCCTAACTTCCTGCCTGTGACTGAGGAGGCTGACATTCGTGA 
GG ATGACAACAT TG C AAT CATTGATGTG C CTGT C CC CAGTTTCTCTGATAGTGAC CCTGC AG 
CAATTATTCATGACTTTGAAAAGGGAATGACTGCTTACCTGGACTTGTTGCTGGGGAACTGC 
TATCTGATGCCCCTCAATACTTCTATTGTTATGCCTCCAAAAAATCTGGTAGAGCTCTTTGG 
CAAACTGGCGAGTGGCAGATATCTGCCTCAAACTTATGTGGTTCGAGAAGACCTAGTTGCTG 
TGGAGGAAATTCGTGATGTTAGTAACCTTGGCATCTTTATTTACCAACTTTGCAATAACAGA 
AAGTCCTTCCGCCTTCGTCGCAGAGACCTCTTGCTGGGTTTCAACAAACGTGCCATTGATAA 
ATGCTGGAAGATT AGACACTT CC C CAACGAATTT ATTGTTGAGACC AAGATCTG TCAAGAGT 
AAGAGGCAACAGATAGAGTGTCCTTGGTAATAAGAAGTCAGAGATTTACAATATGACTTTAA 
CATT AAGGTTTATGGGATACT CAAGATATTTACT CATG CATTTACT CT ATTG CT TATG CTTT 
AAAAAAAGGAAAAAAAAAAAAACTACTAACCACTGCAAGCTCTTGTCAAATTTTAGTTTAAT 
TGGCATTGCTTGTTTTTTGAAACTGAAATTACATGAGTTTCATTTTTTCTTTGCATTTATAG 
GGTTT AG ATTT CTGAAAG CAG C ATGAAT ATAT CACCTAACAT C C TG AC AATAAATT C CATCC 
GTTGTTTTTTTTGTTTGTTTGTTTTTTCTTTTCCTTTAAGTAAGCTCTTTATTCATCTTATG 
GTGGAGCT^TTTTAAAATTTGAAATATTTTAAATTGTTTTTGAACTTTTTGTGTAAAATATA 
TCAGATCTCAACATTGTTGGTTTCTTTTGTTTTTCATTTTGTACAACTTTCTTGAATTTAGA 
AATTACATCTTTGCAGTTCTGTTAGGTGCTCTGTAATTAACCTGACTTATATGTGAACAATT 
TTCATGAGACAGTCATTTTTAACTAATGCAGTGATTCTTTCTCACTACTATCTGTATTGTGG 
AATGCACAAAATTGTGTAGGTGCTGAATGCTGTAAGGAGTTTAGGTTGTATGAATTCTACAA 
CCCTATAATAAATTTTACTCTATACAAAAAAAAAAAAAAAAAAA 
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FIGURE 26 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA58828 
<subunit 1 of l, 263 aa, 1 stop 



<MW: 29741, pi: 5.74, NX(S/T): 1 

MVKIAFNTPTAVQKEEARQDVEALLSRTVRTQILTGKELRVATQEKEGSSGRCMLTLLGLSF 
ILAGIjIVGGACIYKYFMPKSTI YRGEMCFFDSEDPANSLRGGEPNFLPVTEEADIREDDNIA 
IIDVPVPSFSDSDPAAIIHDFEKGMTAYLDLLLGNCYLMPLNTSIVMPPKNLVELFGKLASG 
RYLPQTYWREDLVAVEE IRDVSNLG I F I YQLCNNRKSFRLRRRDLLLGFNKRAIDKCWKI R 
HFPNEFIVETKICQE 

Type II transmembrane domain: 

amino acids 53-75 

N-glycosylation site. 

amino acids 166-170 

Casein kinase II phosphorylation site. 

amino acids 35-39, 132-136, 134-138 

N-myristoylation site. 

amino acids 66-72, 103-109 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 63-74 
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FIGURE 27 



GGAGGAGGGAGGGCGGGCAGGCGCCAGCCCAGAGCAGCCCCGGGCACCAGCACGGACTCTCT 
CTTCCAGCCCAGGTGCCCCCCACTCTCGCTCCATTCGGCGGGAGCACCCAGTCCTGTACGCC 
AAGGAACTGGTCCTGGGGGCACCATGGTTTCGGCGGCAGCCCCCAGCCTCCTCATCCTTCTG 
TTGCTGCTGCTGGGGTCTGTGCCTGCTACCGACGCCCGCTCTGTGCCCCTGAAGGCCACGTT 
CCTGGAGGATGTGGCGGGTAGTGGGGAGGCCGAGGGCTCGTCGGCCTCCTCCCCGAGCCTCC 
CGCCACCCTGGACCCCGGCCCTCAGCCCCACATCGATGGGGCCCCAaCCCACAACCCTGGGG 
GGCCCATCACCCCCCACCAACTTCCTGGATGGGATAGTGGACTTCTTCCGCCAGTACGTGAT 
GCTGATTGCTGTGGTGGGCTCCCTGGCCTTTCTGCTGATGTTCATCGTCTGTGCCGCGGTCA 
TCACCCGGCAGAAGCAGAAGGCCTCGGCCTATTACCCATCGTCCTTCCCCAAGAAGAAGTAC 
GTGGACCAGAGTGACCGGGCCGGGGGCCCCCGGGCCTTCAGTGAGGTCCCCGACAGAGCCCC 
CGACAGCAGGCCCGAGGAAGCCCTGGATTCCTCCCGGCAGCTCCAGGCCGACATCTTGGCCG 
CCACCCAGAACCTCAAGTCCCCCACCAGGGCTGCACTGGGCGGTGGGGACGGAGCCAGGATG 
GTGGAGGGCAGGGGCGCAGAGGAAGAGGAGAAGGGCAGCCAGGAGGGGGACCAGGAAGTCCA 
GGGACATGGGGTCCCAGTGGAGACACCAGAGGCG CAGGAGGAGCCGTG CTCAGGGGTCCTTG 
AGGGGGCTGTGGTGGCCGGTGAGGGCCAAGGGGAGCTGGAAGGGTCTCTCTTGTTAGCCCAG 
GAAGCCCAGGGACCAGTGGGTCCCCCCGAAAGCCCCTGTGCTTGCAGCAGTGTCCACCCCAG 
TGTCTAACAGTCCTCCCGGGCTGCCAGCCCTGACTGTCGGGCCCCCAAGTGGTCACCTCCCC 
GTGTATGAAAAGGCCTTCAGCCCTGACTGCTTCCTGACACTGCCTCCTTGGCCTCCCTGTGG 
TGCCAATCCCAGCATGTGCTGATTCTACAGCAGGCAGAAATGCTGGTCCCCGGTGCCCCGGA 
GG AAT CTT AC CAAGTG CC AT CAT CCTTC ACCT CAGC AG CCCCAAAGGGCT ACAT CCTACAG C 
AC AG CTCC C CTGACAAAGTGAGGGAGGGCACGTGTCCCTGTGACAGCCAGGATAAAAC AT CC 
CCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGTGCCCGGCCCAAACTACTTTTTAAAACA 
GCTACAGGGTAAAATCCTGCAGCACCCACTCTGGAAAATACTGCTCTTAATTTTCCTGAAGG 
TGGCCCCCTGTTTCTAGTTGGTCCAGGATTAGGGATGTGGGGTATAGGGCATTTAAATCCTC 
TCAAGCGCTCTCCAAGCACCCCCGGCCTGGGGGTGAGTTTCTCATCCCGCTACTGCTGCTGG 
GATCAGGTTGAATGAATGGAACTCTTCCTGTCTGGCCTCCAAAGCAGCCTAGAAGCTGAGGG 
GCTGTGTTTGAGGGGACCTCCACCCTGGGGAAGTCCGAGGGGCTGGGGAAGGGTTTCTGACG 
CCCAGCCTGGAGCAGGGGGGCCCTGGCCACCCCCTGTTGCTCACACATTGTCTGGCAGCCTG 
TGTCCACAATATTCGTCAGTCCTCGACAGGGAGCCTGGGCTCCGTCCTGCTTTAGGGAGGCT 
CTGGCAGGAGGTCCTCTCCCCCATCCCTCCATCTGGGGCTCCCCCAACCTCTGCACAGCTCT 
CCAGGTGCTGAGATATAATGCACCAGCACAATAAACCTTTATTCCGGCCTGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGA 
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FIGURE 28 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA58852 
xsubunit 1 of 1, 283 aa f 1 stop 
><MW: 29191, pi: 4^52, NX(S/T): 0 

MVSAAAPSLLILLLLLLGSVPATDARSVPLKATFLEDVAGSGEAEGSSASSPSLPPPWTPAL 
SPTSMGPQPTTLGGPSPPTNFLDGIVDFFRQYVMLIAWGSLAFLLMFIVCAAVITRQKQKA 
SAYYPSSFPKKKYVDQSDRAGGPRAFSEVPDRAPDSRPEEALDSSRQLQADILAATQNLKSP 
TRAALGGGDGARMVEGRGAEEEEKGSQEGDQEVQGHGVPVETPEAQEEPCSGVLEGAWAGE 
GQGE LEG SLLLAQE AQG P VG P PE S P C ACS S VHP S V 

Signal peptide: 

amino acids 1-25 

Transmembrane domain: 

amino acids 94-118 

N-myris toylation site . 

amino acids 18-24, 40-46, 46-52, 145-151, 192-198, 193-199, 
211-217, 238-244, 242-248 
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FIGURE 29 



GTGGACTCTGAGAAG CC CAGG CAGTTGAGG AC AGGAG AG AG AAGG CTGCAGACCCAG AGG GA 
GGGAGGACAGGGAGTCGGAAGGAG GAGG AC AGAGGAGGG C ACAG AGACGC AG AG CAAGGG CG 
GC AAGGAGGAGACCCTGGTGGGAGGAAGACACT CTG GAGAGAGAGGGGGCTGGG CAGAG ATG 
AAGTTCCAGGGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGG 
CCCCCTGCAGAGCGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCC 
TGGGAGACGCCCTGAGCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCT 
GGCTCTAAAGTCAGTGAGGCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAG 
GCAGGTTCCAGGCTTTGGCGCAGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATG 
CT CTGGG AAACACTGGGCAC GAGATTGG CAGACAGGCAGAAGATGTC ATTCGAC ACGGAG CA 
GATGCTGTCCGCGGCTCCTGGCAGGGGGTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGG 
AGGCCATGGCATCTTTGGCTCTCAAGGTGGCCTTGGAGGCCAGGGCCAGGGCAATCCTGGAG 

gtctggggactccgtgggtccacggataccccggaaactcagcaggcagctttggaatgaat 

cctcagggagctccctggggtcaaggaggcaatggagggccaccaaactttgggaccaacac 

tc agggagctgtggcccagc ctgg ctatggtt cagtg agagc cagc aaccag aatgaagggt 

gcacgaatcccccaccatctggctcaggtggaggctccagcaactctgggggaggcagcggc 

tcacagtcgggcagcagtggcagtggcagcaatggtgacaacaacaatggcagcagcagtgg 

tggcagcagcagtggcagcagcagtggcagcagcagtggcggcagcagtggcggcagcagtg 

gtgg cagcagtgg caacagtggtggc ag cagaggtg acag cggcagtgagt cctcctgggga 

tccagcaccggctcctcctccggcaaccacggtgggagcggcggaggaaatggacataaacc 

cgggtgtgaaaagc cagggaatgaagcccgcgggagcggggaatctgggatt cagggctt ca 

gaggac aggg agtttc cagc aacatgagggaaataagc aaagagggc aatcg cc tccttgga 

ggctctggagacaattatcgggggcaagggtcgagctggggcagtggaggaggtgacgctgt 

tggtggagtcaatactgtgaactctgagacgt ct cctgggatgtttaactttgacactttct 

ggaagaattttaaatccaagctgggtttcatcaactgggatgccataaacaaggaccagaga 

agctctcgcatcccgt^cctccagacaaggagccaccagattggatgggagcccccacact 

ccctccttaaaacaccaccctctcatcactaatctcagcccttgcccttgaaataaacctta 

gctgccccacaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa^ 
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FIGURE 30 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss . DNA59212 
xsubunit 1 of 1, 440 aa, 1 stop 
><MW: 42208, pi: 6.36, NX(S/T): 1 

MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGA 
AGSKVSEALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVIRHG 
ADAVRGSWQGVPGHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGM 
NPQGAPWGQGGNGGPPNFGTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGS 
GSQSGSSGSGSNGDNNNGSSSGGSSSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSW 
GSSTGS SSGNHGGSGGGNGHKPGCEKPGNEARGSGESGI QGFRGQGVSSNMRE I SKEGNRLL 
GGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGMFNFDTFWKNFKSKLGFINWDAINKDQ 
RSSRIP 

Signal peptide: 

amino acids 1-21 



N-glycosylation site. 

amino acids 2 6 5-269 

Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 
Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 

74-80, 90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 

155-161, 159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 

218-224, 236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 

249-252, 253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 

279-285, 283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 

295-301, 298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 

323-329, 325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 

383-389, 387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 




WO 00/12708 



PCT/US99/20111 



FIGURE 31 



GACCGGTCCCTCCGGTCCTGGATGTGCGGACTCTGCTGCAGCGAGGGCTGCAGGCCCGCCGGGCGGTGCTCACCG 
TGCCCTGGCTGGTGGAGTTTCTCTCCTTTGCTGACCATGTTGTTCCCTTGCTGGAATATTACCGGGACATCTTCA 
CTCTCCTGCTGCGCCTGCACCGGAGCTTGGTGTTGTCGCAGGAGAGTGAGGGGAAGATGTGTTTCCTGAACAAGC 
TGCTGCTACTTGCTGTCCTGGGCTGGCTTTTCCAGATTCCCACAGTCCCTGAGGACTTGTTCTTTCTGGAAGAGG 
GTCCCTCATATGCCTTTGAGGTGGACACAGTAGCCCCAGAGCATGGCTTGGACAATGCGCCTGTGGTGGACCAGC 
AGCTGCTCTACACCTGCTGCCCCTACATCGGAGAGCTCCGGAAACTGCTCGCTTCGTGGGTGTCAGGCAGTAGTG 
GACGGAGTGGGGGCTTCATGAGGAAAATCACCCCCACCACTACCACCAGCCTGGGAGCCCAGCCTTCCCAGACCA 
GCCAGGGGCTGCAGGCACAGCTCGCCCAGGCCTTTTTCCACAACCAGCCGCCCTCCTTGCGCCGGACCGTAGAGT 
TCGTGGCAGAAAGAATTGGATCAAACTGTGTCAAACATATCAAGGCTACACTGGTGGCAGATCTGGTGCGCCAGG 
CAGAGTCACTTCTCCAAGAGCAGCTGGTGACACAGGGAGAGGAAGGGGGAGACCCAGCCCAGCTGTTGGAGATCT 
TGTGTTCCCAGCTGTGCCCTCACGGGGCCCAGGCATTGGCCCTGGGGCGGGAGTTCTGTCAAAGGAAGAGCCCTG 
GGGCTGTGCGGGCGCTGCTTCCAGAGGAGACCCCGGCAGCCGTTCTGAGCAGTGCAGAGAACATTGCTGTGGGGC 
TTGCAACAGAGAAAGCCTGTGCTTGGCTGTCAGCCAACATCACAGCACTGATCAGGAGGGAGGTGAAAGCAGCAG 
TGAGTCGCACACTTCGAGCCCAGGGTCCTGAACCTGCTGCCCGGGGGGAGCGGAGGGGCTGCTCCCGCGCC TGA C 
GTGCTCTCCTTGGCCGTGGGGCCACGGGACCCTGACGAGGGAGTCTCCCCAGAGCATCTGGAACAGCTCCTAGGC 
CAGCTGGGCCAGACGCTGCGGTGCCGCCAGTTCCTGTGCCCACCTGCTGAGCAGCATCTGGCAAAGTGCTCTGTG 
GAGTTAGCTTCCCTCCTCGTTGCAGATCAAATTCCTATCCTAGGGCCCCCGGCACAGTACAGGCTGGAGAGAGGG 
CAGGCTCGAAGGCTTCTGCACATGCTGCTTTCCTTGTGGAAGGAAGACTTTCAGGGGCCGGTTCCGCTGCAGCTG 
CTGCTGAGCCCAAGAAATGTGGGGCTTCTGGC^GACACAAGGCCAAGGGAGTGGGACTTGCTGCTATTCTTGCTA 
CGGGAGCTGGTGGAGAAGGGTCTGATGGGACGGATGGAGATAGAGGCCTGCCTGGGCAGCCTCCACCAGGCCCAG 
TGGCC^GGGGACTTTGCTGAAGAATTAGCAACACTGTCTAATCTGTTTCTAGCCGAGCCCCACCTGCCAGAACCC 
CAGCTAAGAGCCTGTGAGTTGGTGCAGCCAAACCGGGGCACTGTGCTGGCCCAGAGCTAGGGCTGAGAAGTGGCC 
CTGCCTTGGGCATTGCACCAGAACCCTGGACCCCCGCCTCACGAGGAGGCCCAAGTGCCCAATGCAGACCCTCAC 
TGGTTGGGGTGTAGCTGGGTCTACAGTCAGACTTCCTGCTCTAAGGGTGTCACTGCCTGGCATCCCACCACGCGA 
ATCCTAGAGGAAGGAGAGTTGGCCTGATTTGGGATTATGGCAGAAAAGTCCAGAGATGCCAGTCCTGGAGTAGAA 
GAGGTGGTGTTTGTTTATCTCTTGGATACTAAATGAAATGAGGTGTGTGGGCTTGTCAACACAGAATTCAAGCCT 
CATTTGCTATCCCAGCATCTCTTAAAACTTTGTAGTCTTGGAATTCATGACAGAGGCAAATGACTCCTGCTTAAC 
TTATGAAGAAAGTTAAAACATGAATCTTGGGAGTCTACATTTTCTTATCACC^VGGAGCTGGACTGCCATCTCCTT 
ATAAATGCCTAACACAGGCCGGGTCTGGTGGCTCATGCCTGTAATCCCAGCACTTTGAGAGGCCTGAGGTCGGCG 
GACTGCCTGAGGTCAGGAATTCAAGACCAGCCTGGCCAACATGGCAAAACC^ 

TTATTAGCTGGGCATGGTGGTGTGTGCCTGTAATCCCAGCTACTCAGGAGGATGAGGCAGGAGACCTGCTTGAAC 
CTGGAGGTGGAGGTTGCAGTGAGCCGAGGTCGCACCACTGCACTCCAGTCTGGGTAACAGAGCGAGACTTTCTAG 
AAAAAGCCTAACAAACAGATAAGGTAGGACTCAACCAACTGAAACCTGACTTTCCCCCTGTACCTTCAGCCCCTG 
TGCAGGTAGTAACCTCTTGAGACCTCTCCCTGACGAGGGACCAAGCACAGGGCATTTAGAGCTTTTTAGAATAAA 
CTGGTTTTCTTTAAAAAAAAAAAAAAAAAAAGGGCGGCCGCCCTTTTTTTTTTTTTTTTTTTTTTTTTT1TTTTT 
TTTTTTTTTTTTTTTTAAAAAGGGCTTTTATTAAAATTCTCCCCACACGATGGCT 

TGGGGCGTGTCCTGTAGGGAAAGGCCCTGTTTTCCCTGAGGCGGGGCTGGGCTTGTCCATGGGTCCGCGGAGCTG 
GCCGTGCTTGGCGCCCTGGCGTGTGTCTAGCTGCTTCTTGCCGGGCACAGAGCTGCGGGGTCTGGGGGCACCGGG 
AGCTAAGAGCAGGCTCTGGTGCAGGGGTGGAGGCCTGTCTCTTAACCGACACCCTGAGGTGCTCCTGAGATGCTG 
GGTCCACCCTGAGTGGCACGGGGAGCAGCTGTGGCCGGTGCTCCTTCYTAGGCCAGTCCTGGGGAAACTAAGCTC 
GGGCCCTTCTTTGCAAAGACCGAGGATGGGGTGGGTGTGGGGGACTCATGGGGAATGGCCTGAGGAGCTACGTGT 
GAAGAGGGCGCCGGTTTGTTGGCTGCAGCGGCCTGGAGCGCCTCTCTCCTGAGCCTCAGTTTCCCTTTCCGTCTA 
ATGAAGAACATGCCGTCTCGGTGTCTCAGGGCTATTAGGACTTGCCCTCAGGAAGTGGCCTTGGACGAGCGTCAT 
GTTATTTTCACAACTGTCCTGCGACGTTGGCCTGGGCACGTCATGGAATGGCCCATGTCCCTCTGCTGCGTGGAC 
GTCGCGGTCGGGAGTGCGCAGCCAGAGGCGGGGCCAGACGTGCGCCTGGGGGTGAGGGGAGGCGCCCCGGGAGGG 
CCTCACAGGAAGTTGGGCTCCCGCACCACCAGGCAGGGCGGGCTCCCGCCGCCGCCGCCGCCACCACCGTCCAGG 
GGCCGGTAGACAAAGTGGAAGTCGCGCTTGGGCTCGCTGCGCAGCAGGTAGCCCTTGATGCAGTGCGGCAGCGCG 
TCGTCCGCCAGCTGGAAGCAGCGCCCGTCCACCAGCACGAACAGCCGGTGCGCCT 




WO 00/12708 



PCT/US99/2011I 



FIGURE 3 2 



MCFLNKLLLLAVLGWLFQ I PTVPEDLFFLEEGPS YAFE VDTVAPEHGLDNAP WDQQLLYTC 
CPYIGELRKLLASWVSGSSGRSGGFMRKITPTTTTSLGAQPSQTSQGLQAQLAQAFFHNQPP 
SLRRTVEFVAERIGSNCVKHIKATLVADLVRQAESLLQEQLVTQGEEGGDPAQLLEILCSQL 
CPHGAQALALGREFCQRKSPGAVRALLPEETPAAVLSSAENIAVGLATEKACAWLSANITAL 
IRREVKAAVSRTLRAQGPEPAARGERRGCSRA 

Signal peptide: 

amino acids 1-18 

N-glycosylation site. 

amino acids 244-248 

CAMP- and cGMP- dependent protein kinase phosphorylation site, 

amino acids 89-93 

Casein kinase II phosphorylation site. 

amino acids 21-25, 167-171, 223-227 

N-myristoylation site. 

amino acids 100-106, 172-178, 207-213 

Microbodies C- terminal targeting signal. 

amino acids 278-282 
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FIGURE 33 



TCCCTTGACAGGTCTGGTGGCTGGTTCGGGGTCTACTGAAGGCTGTCTTGATCAGGAAACTG 

AAGACTCTCTGCTTTTGCCACAGCAGTTCCTGCAGCTTCCTTGAGGTGTGAACCCACATCCC 

TGCCCCCAGGGCCACCTGCAGGACGCCGACACCTACCCCTCAGCAGACGCCGGAGAGAAATG 

AGTAGCAACAAAGAGCAG CGGTC AGC AGTGT T CGTG AT CCT CTTTG CC CTC AT C AC CAT C CT 

CATCCTCTACAGCTCCAACAGTGCCAATGAGGTCTTCCATTACGGCTCCCTGCGGGGCCGTA 

GCCGCCGACCTGTCAACCTCAAGAAGTGGAGCATCACTGACGGCTATGTCCCCATTCTCGGC 

AACAAGACACTGCCCTCTCGGTGCCACCAGTGTGTGATTGTCAGCAGCTCCAGCCACCTGCT 

GGGCACC AAG CTGGG C CC TG AG AT CG AG CGGG CTGAGTGT AC AATCCGCATGAATGATGCAC 

CCACCACTGGCTACTCAGCTGATGTGGGCAACAAGACCACCTACCGCGTCGTGGCCCATTCC 

AGTGTGTTCCGCGTGCTGAGGAGGCCCCAGGAGTTTGTCAACCGGACCCCTGAAACCGTGTT 

CATCTTCTGGGGGCCCCCGAGCAAGATGCAGAAGCCCCAGGGCAGCCTCGTGCGTGTGATCC 

AGCGAGCGGGCCTGGTGTTCCCCAACATGGAAGCATATGCCGTCTCTCCCGGCCGCATGCGG 

CAATTTGACGAC CT CTT C CGGGGTGAGACGGGCAAGGACAGGGAGAAGTCT C AT TCGTGGTT 

GAGCACAGGCTGGTTTACCATGGTGATCGCGGTGGAGTTGTGTGACCACGTGCATGTCTATG 

GCATGGTCCCCCCCAACTACTGCAGCCAGCGGCCCCGCCTCCAGCGCATGCCCTACCACTAC 

T ACG AG C CCAAGGGGC CGGACGAATGTGT CACCT AC AT CC AG AATG AG CAC AGT CG CAAGGG 

CAACCACCACCGCTTCATCACCGAGAAAAGGGTCTTCTCATCGTGGGCCCAGCTGTATGGCA 

TCACCTTCTCCCACCCCTCCTGGACCTAGGCCACCCAGCCTGTGGGACCTCAGGAGGGTCAG 

AGGAGAAGCAGCCTCCGCCCAGCCGCTAGGCCAGGGACCATCTTCTGGCCAATCAAGGCTTG 

CTGGAGTGTCTCCCAGCCAATCAGGGCCTTGAGGAGGATGTATCCTCCAGCCAATCAGGGCC 

TGGGGAATCTGTTGGCGAATCAGGGATTTGGGAGTCTATGTGGTTAATCAGGGGTGTCTTTC 

TTGTGCAGTCAGGGTCTGCGCACAGTCAATCAGGGTAGAGGGGGTATTTCTGAGTCAATCTG 

AGGCTAAGGACATGTCCTTT CC CATG AGGC CTTGGTT C AGAG CC C C AGGAATGGAC CC CCCA 

ATCACTCCCCACTCTGCTGGGATAATGGGGTCCTGTCCCAAGGAGCTGGGAACTTGGTGTTG 

CCCCCTCAAT TT C C AG CACC AGAAAGAG AG ATTGTGTGGGGGTAGAAG CTGT CTGGAGGC C C 

GGCCAGAGAATTTGTGGGGTTGTGGAGGTTGTGGGGGCGGTGGGGAGGTCCCAGAGGTGGGA 

GGCTGGCATCCAGGTCTTGGCTCTGCCCTGAGACCTTGGACAAACCCTTCCCCCTCTCTGGG 

CACCCTTCTGCCCACACCAGTTTCCAGTGCGGAGTCTGAGACCCTTTCCACCTCCCCTACAA 

GTGCCCTCGGGTCTGTCCTCCCCGTCTGGACCCTCCCAGCCACTATCCCTTGCTGGAAGGCT 

CAGCTCTTTGGGGGGTCTGGGGTGACCTCCCCACCTCCTGGAAAACTTTAGGGTATTTTTGC 

GCAAACTCCTTCAGGGTTGGGGGACTCTGAAGGAAACGGGACAAAACCTTAAGCTGTTTTCT 

TAGCCCCTCAGCCAGCTGCCATTAGCTTGGCTCTTAAAGGGCCAGGCCTCCTTTTCTGCCCT 

CTAGCAGGGAGGTTTTCCAACTGTTGGAGGCGCCTTTGGGGCTGCCCCTTTGTCTGGAGTCA 

CTGGGGGCTTCCGAGGGTCTCCCTCGACCCTCTGTCGTCCTGGGATGGCTGTCGGGAGCTGT 

ATCACCTGGGTTCTGTCCCCTGGCTCTGTATCAGGCACTTTATTAAAGCTGGGCCTCAGTGG 

GGTGTGTTTGTCTCCTGCTCTTCTGGAGCCTGGAAGGAAAGGGCTTCAGGAGGAGGCTGTGA 

GG CTGGAGGGACC AGATGGAGGAGGCCAGC AG CT AG CC ATTGC ACACTGGGGTG ATGGGT GG 

GGGCGGTGACTGCCCCAGACTTGGTTTTGTAATGATTTGTACAGGAATAAACACACCTACGC 

TCCGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 34 



MS SNKEQRS AVF VI LFAL I TI L I LYS SNS ANE VFHYGSLRGRSRRP VNLKKWS I TDGYVP I L 
GNKTLPSRCHQCVIVSSSSHLLGTKLGPEIERAECTIRMNDAPTTGYSADVGNKTTYRWAH 
SSVFRVLRRPQEFVNRTPETVFIFWGPPSKMQKPQGSLVRVIQRAGLVFPNMEAYAVSPGRM 
RQFDDLFRGETGKDREKSHSWLSTGWFTMVIAVELCDHVHVYGMVPPNYCSQRPRLQRMPYH 
YYEPKGPDECVTYIQNEHSRKGNHHRFITEKRVFSSWAQLYGITFSHPSWT 

Signal peptide: 

amino acids 1-29 

Transmembrane domain : 

amino acids 9-31 (type II) 

N-glycosylation site. 

amino acids 64-68, 115-119 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 50-54 

Casein kinase II phosphorylation site. 

amino acids 3-7, 29-33, 53-57, 197-201 

Tyrosine kinase phosphorylation site. 

amino acids 253-262 

N-myristoylation site. 

amino acids 37-43, 114-120, 290-294 



3HJX7 O 





WO 00/12708 



PCT/US99/20111 



FIGURE 35 



GTTTCTCATAGTTGGCGTCTTCTAAAGGAAAAACACTAAAATGAGGAACTCAGCGGACCGGGAGCGACGCAGCTT 
GAGGGAAGCATCCCTAGCTGTTGGCGCAGAGGGGCGAGGCTGAAGCCGAGTGGCCCGAGGTGTCTGAGGGGCTGG 
GGCAAAGGTGAAAGAGTTTCAGAACAAGCTTCCTGGAACCCATGACCCATGAAGTCTTGTCGACATTTATACCGT 
CTGAGGGTAGCAGCTCGAAACTAGAAGAAGTGGAGTGTTGCCAGGGACGGCAGTATCTCTTTGTGTGACCCTGGC 
GGCCTATGGGACGTTGGCTTCAGACCTTTGTGATACACCATGCTGCGTGGGACGATGACGGCGTGGAGAGGAATG 
AGGCCTGAGGTCACACTGGCTTGCCTCCTCCTAGCCACAGCAGGCTGCTTTGCTGACTTGAACGAGGTCCCTCAG 
GTCACCGTCCAGCCTGCGTCCACCGTCCAGAAGCCCGGAGGCACTGTGATCTTGGGCTGCGTGGTGGAACCTCCA 
AGGATGAATGTAACCTGGCGCCTGAATGGAAAGGAGCTGAATGGCTCGGATGATGCTCTGGGTGTCCTCATCACC 
CACGGGACCCTCGTCATCACTrGCCCTTAACAACCACACTGTGGGACGGTACCAGTGTGTGGCCCGGATGCCTGCG 
GGGGCTGTGGCCAGCGTGCCAGCCACTGTGACACTAGCCAATCTCCAGGACTTCAAGTTAGATGTGCAGCACGTG 
ATTGAAGTGGATGAGGGAAACACAGCAGTCATTGCCTGCCACCTGCCTGAGAGCCACCCCAAAGCCCAGGTCCGG 
TACAGCGTCAAACAAGAGTGGCTGGAGGCCTCCAGAGGTAACTACCTGATCATGCCCTCAGGGAACCTCCAGATT 
GTGAATGCCAGCCAGGAGGACGAGGGCATGTACAAGTGTGCAGCCTACAACCCAGTGACCCAGGAAGTGAAAACC 
TCCGGCTCCAGCGACAGGCTACGTGTGCGCCGCTCCACCGCTGAGGCTGCCCGCATCATCTACCCCCCAGAGGCC 
CAAACCATCATCGTCACCAAAGGCCAGAGTCTCATTCTGGAGTGTGTGGCCAGTGGAATCCCACCCCCACGGGTC 
ACCTGGGCCAAGGATGGGTCCAGTGTCACCGGCTACAACAAGACGCGCTTCCTGCTGAGCAACCTCCTCATCGAC 
ACCACCAGCGAGGAGGACTCAGGCACCTACCGCTGCATGGCCGACAATGGGGTTGGGCAGCCCGGGGCAGCGGTC 
ATCCTCTACAATGTCCAGGTGTTTGAACCCCCTGAGGTCACCATGGAGCTATCCCAGCTGGTCATCCCCTGGGGC 
CAGAGTGCCAAGCTTACCTGTGAGGTGCGTGGGAACCCCCCGCCCTCCGTGCTGTGGCTGAGGAA'BGCTGTGCCC 
CTCATCTCCAGCCAGCGCCTCCGGCTCTCCCGCAGGGCCCTGCGCGTGCTCAGCATGGGGCCTGAGGACGAAGGC 
GTCTACCAGTGCATGGCCGAGAACGAGGTTGGGAGCGCCCATGCCGTAGTCCAGCTGCGGACCTCCAGGCCAAGC 
ATAACCCCAAGGCTATGGCAGGATGCTGAGCTGGCTACTGGCACACCTCCTGTATCACCCTCCAAACTCGGCAAC 
CCTGAGCAGATGCTGAGGGGGCAACCGGCGCTCCCCAGACCCCCAACGTCAGTGGGGCCTGCTTCCCCGAAGTGT 
CCAGGAGAGAAGGGGCAGGGGGCTCCCGCCGAGGCTCCCATCATCCTCAGCTCGCCCCGCACCTCCAAGACAGAC 
TCATATGAACTGGTGTGGCGGCCTCGGCATGAGGGCAGTGGCCGGGCGCCAATCCTCTACTATGTGGTGAAACAC 
CGCAAGCAGGTCACAAATTCCTCTGACGATTGGACCATCTCTGGCATTCCAGCCAACCAGCACCGCCTGACCCTC 
ACCAGACTTGACCCCGGGAGCTTGTATGAAGTGGAGATGGCAGCTTACAACTGTGCGGGAGAGGGCCAGACAGCC 
ATGGTCACCTTCCGAACTGGACGGCGGCCCAAACCCGAGATCATGGCCAGCAAAGAGCAGCAGATCCAGAGAGAC 
GACCCTGGAGCCAGTCCCCAGAGCAGCAGCCAGCCAGACCACGGCCGCCTCTCCCCCCCAGAAGCTCCCGACAGG 
CCCACCATCTCCACGGCCTCCGAGACCTCAGTGTACGTGACCTGGATTCCCCGTGGGAATGGTGGGTTCCCAATC 
CAGTCCTTCCGTGTGGAGTACAAGAAGCTAAAGAAAGTGGGAGACTGGATTCTGGCCACCAGCGCCATCCCCCCA 
TCGCGGCTGTCCGTGGAGATCACGGGCCTAGAGAAAGGCACCTCCTACAAGTTTCGAGTCCGGGCTCTGAACATG 
CTGGGGGAGAGCGAGCCCAGCGCCCCCTCTCGGCCCTACGTGGTGTCGGGCTACAGCGGTCGCGTGTACGAGAGG 
CCCGTGGCAGGTCCTTATATCACCTTCACGGATGCGGTCAATGAGACCACCATCATGCTCAAGTGGATGTACATC 
CCAGCAAGTAACAACAACACCCCAATCCATGGCTTTTATATCTATTATCGACCCACAGACAGTGACAATGATAGT 
GACTACAAGAAGGATATGGTGGAAGGGGACAAGTACTGGCACTCCATCAGCCACCTGCAGCCAGAGACCTCCTAC 
GACATTAAGATGCAGTGCTTCAATG7VAGGAGGGGAGAGCGAGTTCAGCAACGTGATGATCTGTGAGACCAAAGCT 
CGGAAGTCTTCTGGCCAGCCTGGTCGACTGCCACCCCCAACTCTGGCCCCACCACAGCCGCCCCTTCCTGAAACC 
ATAGAGCGGCCGGTGGGCACTGGGGCCATGGTGGCTCGCTCCAGCGACCTGCCCTATCTGATTGTCGGGGTCGTC 
CTGGGCTCCATCGTTCTCATCATCGTCACCTTCATCCCCTTCTGCTTGTGGAGGGCCTGGTCTAAGCAAAAACAT 
ACAACAGACCTGGGTTTTCCTCGAAGTGCCCTTCCACCCTCCTGCCCGTATACTATGGTGCCATTGGGAGGACTC 
CCAGGCCACCAGGCCAGTGGACAGCCCTACCTCAGTGGCATCAGTGGACGGGCCTGTGCTAATGGGATCCACATG 
AATAGGGGCTGCCCCTCGGCTGCAGTGGGCTACCCGGGCATGAAGCCCCAGCAGCACTGCCCAGGCGAGCTTCAG 
CAGC^GAGTGACACCAGCAGCCTGCTGAGGCAGACCCATCTTGGCAATGGATATGACCCCCAAAGTCACCAGATC 
ACGAGGGGTCCCAAGTCTAGCCCGGACGAGGGCTCTTTCTTATACACACTGCCCGACGACTCCACTCACCAGCTG 
CTGCAGCCCCATCACGACTGCTGCCAACGCCAGGAGCAGCCTGCTX3CTGTGGGCCAGTCAGGGGTGAGGAGAGCC 
CCCGACAGTCCTGTCCTGGAAGCAGTGTGGGACCCTCCATTTCACTCAGGGCCCCCATGCTGCTTGGGCCTTGTG 
CCAGTTGAAGAGGTGGACAGTCCTGACTCCTGCCAAGTGAGTGGAGGAGACTGGTGTCCCCAGCACCCCGTAGGG 
GCCTACGTAGGACAGGAACCrGGAATGCAGCTCTCCCCGGGGCCACTGGTGCGTGTGTCTTTTGAAACACCACCT 
CTCACAATT TAG GCAGAAGCTGATATCCCAGAAAGACTATATATTGTTTTTTTTTTAAA 

AGAGACAGAGAAAATTGGTATTTATTTTTCTATTATAGCCATATTTATATATTTATGCACTTGTAAATAAATGTA 
TATGTTTTATAATTCTGGAGAGACATAAGGAGTCCTACCCGTTGAGGTTGGAGAGGGAAAATAAAGAAGCTGCCA 
CCTAACAGGAGTCACCCAGGAAAGCACCGCACAGGCTGGCGCGGGACAGACTCCTAACCTGGGGCCTCTGCAGTG 
GCAGGCGAGGCTGCAGGAGGCCCACTVGATAAGCTGGC^GAGGAAGGATCCCAGGCACATGGTTCATCACGAGCA 
TGAGGGAACAGCAAGGGGCACGGTATCACAGCCTGGAGACACCCACACAGATGGCTGGATCCGGTGCTACGGGAA 
ACATTTTCCTAAGATGCCCATGAGAACAGACCAAGATGTGTACAGCACTATGAGCATTAAAAAACCTTCCAGAAT 
CAATAATCCGTGGCAACATATOTCTGTAAAAAC^AA(^CTGTAACTTCTAAATAAATGTTTAG 
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MLRGTMTAWRGMRPEVTLACLLLATAGCFADLNEVPQVTVQPASTVQKPGGTVILGCVVEPP 
RMNVTWRLNGKELNGSDDALGVLI THGTLV ITALNNHTVGRYQCVARMPAGAVASVPATVTL 
ANLQDFKLDVQHVIEVDEGNTAVIACHLPESHPKAQVRYSVKQEWLEASRGNYLIMPSGNLQ 
I VNASQEDEGMYKCAAYNPVTQEVKTSGSSDRLRVRRSTAEAARI I YPPEAQTI IVTKGQSL 
ILECVASGIPPPRVTWAKDGSSVTGYNKTRFLLSNLLIDTTSEEDSGTYRCMADNGVGQPGA 
AVILYNVQVFEPPEVTMELSQLVIPWGQSAKLTCEVRGNPPPSVLWLRNAVPLISSQRLRLS 
RRALRVLSMGPEDEGVYQCMAENEVGSAHAWQLRTSRPSITPRLWQDAELATGTPPVSPSK 
LGNPEQMLRGQPALPRPPTSVGPASPKCPGEKGQGAPAEAPIILSSPRTSKTDSYELVWRPR 
HEGSGRAP ILYYVVKHRKQVTNSSDDWT ISGI PANQHRLTLTRLDPGSLYEVEMAAYNCAGE 
GQTAMVTFRTGRRPKPEIMASKEQQIQRDDPGASPQSSSQPDHGRLSPPEAPDRPTISTASE 
TSVYVTWI PRGNGGFP IQSFRVEYKKLKKVGDWI LATS AI PPSRLSVE ITGLEKGTS YKFRV 
RAIjNMLGESEPSAPSRPYVVSGYSGRVYERPVAGPYITFTDAVNE^ 

PIHGFYIYYRPTDSDNDSDYKKDMVEGDKYWHSISHLQPETSYDIKMQCFNEGGESEFSNVM 
ICETKARKSSGQPGRLPPPTIAPPQPPLPETIERPVGTGAMVARSSDLPYLIVGVVLGSIVL 
IIVTFIPFCLWRAWSKQKHTTDLGFPRSALPPSCPYTMVPLGGLPGHQASGQPYLSGISGRA 
CANGIHMNRGCPSAAVGYPGMKPQQHCPGELQQQSDTSSLLRQTHLGNGYDPQSHQITRGPK 
SSPDEGSFLYTLPDDSTHQLLQPHHDCCQRQEQPAAVGQSGVRRAPDSPVLEAVWDPPFHSG 
PPCCLGLVPVEEVDSPDSCQVSGGDWCPQHPVGAYVGQEPGMQLSPGPLVRVSFETPPLTI 

Signal peptide: 

amino acids 1-3 0 

Transmembrane domain: 

amino acids 16-30 (type II), 854-879 
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CGGGAGGCTGGGTCGTCATGATCCGGACCCCATTGTCGGCCTCTGCCCATCGCCTGCTCCTC 
CCAGGCTCCCGCGGCCGACCCCCGCGCAACATGCAGCCCACGGGCCGCGAGGGTTCCCGCGC 
GCTCAGCCGGCGGTATCTGCGGCGTCTGCTGCTCCTGCTACTGCTGCTGCTGCTGCGGCAGC 
CCGTAACCCGCGCGGAGACCACGCCGGGCGCCCCCAGAGCCCTCTCCACGCTGGGCTCCCCC 
AGCCTCTTCACCACGCCGGGTGTCCCCAGCGCCCTCACTACCCCAGGCCTCACTACGCCAGG 
CACCCCCAAAACCCTGGACCTTCGGGGTCGCGCGCAGGCCCTGATGCGGAGTTTCCCACTCG 
TGGACGGC CAC AATGACC TGCC CC AGGT CC TG AGACAGCGTTACAAGAATGTG CTTCAGGAT 
GTTAACCTGCGAAATTTCAGCCATGGTCAGACCAGCCTGGACAGGCTTAGAGACGGCCTCGT 
GGGTGCCCAGTTCTGGTCAGCCTCCGTCTCATGCCAGTCCCAGGACCAGACTGCCGTGCGCC 
TCGCCCTGGAGCAGATTGACCTCATTCACCGCATGTGTGCCTCCTACTCTGAACTCGAGCTT 
GTGACCTCAGCTGAAGGTCTGAACAGCTCTCAAAAGCTGGCCTGCCTCATTGGCGTGNAGGG 
TGGTCACTCACTGGACAGCAGCCTCTCTGTGCTGCGCAGTTTCTATGTGCTGGGGGTGCGCT 
ACCTGACACTTACCTTCACCTGCAGTACACCATGGGCAGAGAGTTCCACCAAGTTCAGACAC 
CACATGT ACACCAACGTCAGCGGATTGACAAG CTTTGGTG AGAAAG TAGTAGAGGAGTTGAA 
CCGCCTGGGCATGATGATAGATTTGTCCTATGCATCGGACACCTTGATAAGAAGGGTCCTGG 
AAGTGTCTCAGGCTCCTGTGATCTTCTCCCACTCAGCTGCCAGAGCTGTGTGTGACAATTTG 
TTGAATGTTC CCGATGATAT CCTG CAGCTTCTGAAGAACGGTGGCATCGTGATGGTGACACT 
GTCCATGGGGGTGCTGCAGTGCAACCTGCTTGCTAACGTGTCCACTGTGGCAGATCACTTTG 
AC CAC ATCAGGG CAGT C ATTGG AT CTGAGTTC ATCGGGATTGGTGGAAATTATGACGGGACT 
GGCCGGTTCCCTCAGGGGCTGGAGGATGTGTCCACATACCCAGTCCTGATAGAGGAGTTGCT 
GAGTCGTASCTGGAGCGAGGAAGAGCTTCAAGGTGTCCTTCGTGGAAACCTGCTGCGGGTCT 
TCAGACAAGTGGAAAAGGTGAGAGAGGAGAGCAGGGCGCAGAGCCCCGTGGAGGCTGAGTTT 
CCATATGGGCAACTGAGCACATCCTGCCACTCCCACCTCGTGCCTCAGAATGGACACCAGGC 
TACTCATCTGGAGGTGACCAAGCAGCCAACCAATCGGGTCCCCTGGAGGTCCTCAAATGCCT 
CCCCATACCTTGTTCCAGGCCTTGTGGCTGCTGCCACCATCCCAACCTTCACCCAGTGGCTC 
TGCTGACACAGTCGGTCCCCGCAGAGGTCACTGTGGCAAAGCCTCACAAAGCCCCCTCTCCT 
AGTTCATTCACAAGCATATGCTGAGAATAAACATGTTACACATGGAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA59817 
xsubunit 1 of 1, 487 aa, 1 stop, 2 unknown 
><MW: 53569.32, pi: 7.68, NX(S/T): 5 

MQPTGREGSRALSRRYLRRLLLLLLLLLLRQPVTRAETTPGAPRALSTLGSPSLFTTPGVPS 
ALTTPGLTTPGTPKTLDLRGRAQALMRSFPLVDGHNDLPQVLRQRYKNVLQDVNLRNFSHGQ 
TSLDRLRDGLVGAQFWSASVSCQSQDQTAVRLALEQIDLIHRMCASYSELELVTSAEGLNSS 
QKLACLIGVXGGHSLDSSLSVLRSFYVLGVRYLTLTFTCSTPWAESSTKFRHHMYTNVSGLT 
S FGEKWEELNRLGMMIDLS YASDTL IRRVLEVSQAPVI FSHSAARAVCDNLLNVPDD I LQL 
LKNGGIVMVTLSMGVLQCNLLANVSTVADHFDHIRAVIGSEFIGIGGNYDGTGRFPQGLEDV 
STYPVLIEELLSRXWSEEELQGVLRGNLLRVFRQVEKVREESRAQSPVEAEFPYGQLSTSCH 
SHLVPQNGHQATHLEVTKQPTNRVPWRSSNAS P YLVPGLVAAATI PTFTQWLC 

Important features of the protein: 
Signal peptide: 

amino acids 1-36 

Transmembrane domain: 

amino acids 313-331 

N-glycosylation sites. 

amino acids 119-122, 184-187, 243-246 and 333-336 
N-myristoylation sites. 

amino acids 41-46, 59-64, 73-78, 133-138, 182-187, 194-199, 324- 
329, 354-359, 357-362, 394-399, 427-432 and 472-477. 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 13 6-146 
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TGCTAGGCTCTGTCCCACAATGCACCCGAGAGCAGGAGCTGAAAGCCTCTAACACCCACAGA 
TCCCTCTATGACTGCAATGTGAGGTGTCCGGCTTTGCTGGCCCAGCAAGCCTGATAAGCATG 
AAGCTCTTATCTTTGGTGGCTGTGGTCGGGTGTTTGCTGGTGCCCCCAGCTGAAGCCAACAA 
GAGTTCTGAAGATATCCGGTGCAAATGCATCTGTCCACCTTATAGAAACATCAGTGGGCACA 
TTTACAACCAGAATGTATCCCAGAAGGACTGCAACTGCCTGCACGTGGTGGAGCCCATGCCA 
GTGCCTGGCCATGACGTGGAGGCCTACTGCCTGCTGTGCGAGTGCAGGTACGAGGAGCGCAG 
CACCACCACCATCAAGGTCATCATTGTCATCTACCTGTCCGTGGTGGGTGCCCTGTTGCTCT 
ACATGGCCTTCCTGATGCTGGTGGACCCTCTGATCCGAAAGCCGGATGCATACACTGAGCAA 
CTGCACAATGAGGAGGAGAATGAGGATGCTCGCTCTATGGCAGCAGCTGCTGCATCCCTCGG 
GGGACCCCGAGCAAACACAGTCCTGGAGCGTGTGGAAGGTGCCCAGCAGCGGTGGAAGCTGC 
AGGTGCAGGAGCAGCGGAAGACAGTCTTCGATCGGCACAAGATGCTCAGCTAGATGGGCTGG 
TGTGGTTGGGTCAAGGCCCCAACACCATGGCTGCCAGCTTCCAGGCTGGACAAAGCAGGGGG 
CTACTTCTCCCTTCCCTCGGTTCCAGTCTTCCCTTTAAAAGCCTGTGGCATTTTTCCTCCTT 
CTCCCTAACTTTAGAAATGTTGTACTTGGCTATTTTGATTAGGGAAGAGGGATGTGGTCTCT 
GATCTCTGTTGTCTTCTTGGGTCTTTGGGGTTGAAGGGAGGGGGAAGGCAGGCCAGAAGGGA 
ATGGAGACATTCGAGGCGGCCTCAGGAGTGGATGCGATCTGTCTCTCCTGGCTCCACTCTTG 
CCGCCTTCCAGCTCTGAGTCTTGGGAATGTTGTTACCCTTGGAAGATAAAGCTGGGTCTTCA 
GGAACTCAGTGTCTGGGAGGAAAGCATGGCCCAGCATTCAGCATGTGTTCCTTTCTGCAGTG 
GTTCTTATCACCACCTCCCTCCCAGCCCCGGCGCCTCAGCCCCAGCCCCAGCTCCAGCCCTG 
AGGACAGCTCTGATGGGAGAGCTGGGCCCCCTGAGCCCACTGGGTCTTCAGGGTGCACTGGA 
AGCTGGTGTTCGCTGTCCCCTGTGCACTTCTCGCACTGGGGCATGGAGTGCCCATGCATACT 
CTGCTGCCGGTCCCCTCACCTGCACTTGAGGGGTCTGGGCAGTCCCTCCTCTCCCCAGTGTC 
CACAGTCACTGAGCCAGACGGTCGGTTGGAACATGAGACTCGAGGCTGAGCGTGGATCTGAA 
CACCACAGCCCCTGTACTTGGGTTGCCTCTTGTCCCTGAACTTCGTTGTACCAGTGCATGGA 
GAGAAAATTTTGTCCTCTTGTCTTAGAGTTGTGTGTAAATCAAGGAAGCCATCATTAAATTG 
TTTTATTTCTCTCA 
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</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA60278 
<subunit 1 of 1, 183 aa, 1 stop 
<MW: 20574, pi: 6.60, NX(S/T): 3 

MKLLS LVAWGCLLVP P AE ANKS S ED I RCKC I C P P YRN I SGH I YNQNVSQKDCNCLHVVEPM 
PVPGHDVEAYCLLCECRYEERSTTTIKVIIVIYLSWGALLLYMAFLMLVDPLIRKPDAYTE 
QLHNEEENEDARSMAAAAASLGGPRANTVLERVEGAQQRWKLQVQEQRKTVFDRHKMLS 

Important features : 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 90-112 

N-glycosylation sites. 

amino acids 21-24, 38-41 and 47-50 
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AGCGGGTCTCGCTTGGGTTCCGCTAATTTCTGTCCTGAGGCGTGAGACTGAGTTCATAGGGTCCTGGGTCCCCGA 

ACCAGGAAGGGTTGAGGGAACACAATCTGCAAGCCCCCGCGACCCAAGTGAGGGGCCCCGTGTTGGGGTCCTCCC 

TCCCTTTGCATTCCCACCCCTCCGGGCTTTGCGTCTTCCTGGGGACCCCCTCGCCGGGAGATGGCCGCGTTGATG 

CGGAGCAAGGATTCGTCCTGCTGCCTGCTCCTACTGGCCGCGGTGCTGATGGTGGAGAGCTCACAGATCGGCAGT 

TCGCGGGCCAAACTCAACTCCATCAAGTCCTCTCTGGGCGGGGAGACGCCTGGTCAGGCCGCCAATCGATCTGCG 

GGCATGTACCAAGGACTGGCATTCGGCGGCAGTAAGAAGGGCAAAAACCTGGGGCAGGCCTACCCTTGTAGCAGT 

GATAAGGAGTGTGAAGTTGGGAGGTATTGCCACAGTCCCCACCAAGGATCATCGGCCTGCATGGTGTGTCGGAGA 

AAAAAGAAGCGCTGCCACCGAGATGGCATGTGCTGCCCCAGTACCCGCTGCAATAATGGCATCTGTATCCCAGTT 

ACTGAAAGCATCTTAACCCCTCACATCCCGGCTCTGGATGGTACTCGGCACAGAGATCGAAACCACGGTCATTAC 

TCAAACCATGACTTGGGATGGCAGAATCTAGGAAGACCACACACTAAGATGTCACATATAAAAGGGCATGAAGGA 

GACCCCTGCCTACGATCATCAGACTGCATTGAAGGGTTTTGCTGTGCTCGTCATTTCTGGACCAAAATCTGCAAA 

CCAGTGCTCCATC^GGGGGAAGTCTGTACCAAACAACGCAAGAAGGGTTCTCATGGGCTGGAAATTTTCC^ 

TGCGACTGTGCGAAGGGCCTGTCTTGCAAAGTATGGAAAGATGCCACCTACTCCTCCAAAGCCAGACTCCATGTG 

TGTCAGAAAATT TGA TCACCATTGAGGAACATCATCAATTGCAGACTGTGAAGTTGTGTATTTAATGCATTATAG 

CATGGTGGAAAATAAGGTTCAGATGCAGAAGAATGGCTAAAATAAGAAACGTGATAAGAATATAGATGATCACAA 

AAAGGGAGAAAGAAAACATGAACTGAATAGATTAGAATGGGTGACAAATGCAGTGCAGCCAGTGTTTCCATTATG 

CAACTTGTCTATGTAAATAATGTACAC^TTTGTGGAAAATGCTATTATTAAGAGAACAAGCACACAGTGGAAATT 

ACTGATGAGTAGC^TGTGACTTTCCAAGAGTTTAGGTTGTGCTGGAGGAGAGGTTTCCTTCAGATTGCTGATTGC 

TTATACAAATAAC CTACATG C CAG ATT TCTATT CAACGTTAGAGTTTAACAAAAT ACT C CTAG AATAACTTGTTA 

TACAATAGGTTCTAAAAATAAAATTGCTAAACAAGAAATGAAAACATGGAGCATTGTTAATTTACAACAGAAAAT 

TACCTTTTGATTTGTAACACTACTTCTGCTGTTCAATCAAGAGTCTTGGTAGATAAGAAAA71AATCAGTCAATAT 

TTC CAAATAATTGCAAAATAATGG CC AGTTGTTTAGGAAGGC CTTTAGGAAGACAAATAAATAACAAACAAACAG 

C CACAAAT ACTTTTTTTTCAAAATTTTAGTTTTACCTGTAATTAATAAGAAC TG ATACAAGACAAAAACAGTTC C 

TTCAGATTCTACGGAATGACAGTATATCTCTCTTTATCCTATGTGATTCCTGCTCTGAATGCATTATATTTTCCA 

AACTATACCCATAAATTGTGACTAGTAAAATACTTACAGAGAGCAGAATTTTCACAGATGGCAAAAAAATTTAAA 

GATGTCCAATATATGTGGGAAAAGAGCTAACAGAGAGATCATTATTTCTTAAAGATTGGCCATAACCTATATTTT 

GATAGAATTAGATTGGTAAATACATGTATTCATACATACTCTGTGGTAATAGAGACTTAAGCTGGATCTGTACTG 

CACTGGAGTAAGCAAGAAAATTGGGAAAACTTTTTCGTTTGTTCAGGTTTTGGCAAC^CATAGATCATATO 

AGGCAGAAGTTGGCTGTTCATCITTGAAACCAGGGGATGC^^ 

CATAATATTTACTATGCAGATGAATTC^GTGTGAGGTCCTGTGTCCGTACTATCCTCAAATTATTTATTTTATAG 
TGC TGAGATC C TCAAATAAT C TCAATTT CAG G AGG TTT CACAAAATGTAC TC C TG AAGT AGACAGAGT AGTGAGG 
TTTCATTGCCCTCTATAAGCTTCTGACTAGCCAATGGCATCATCCAATTTTCTT^ 

CTTTATTGCCAAAGGGCTAGTTTCGGTTTTCTGCAGCCATTGCGGTTAAAAAATATAAGTAGGATAACTTGTAAA 

ACCTGCATATTGCTAATCTATAGACACCACAGTTTCTAAATTCTTTGAAACCACTTTACTACTTTTTTTAAACTT 

AACTCAGTTCTAAATACTTTGTCTGGAGCACAAAACAATAAAAGGTTATCTTATAGTCGTGACTTTAAACTTTTG 

TAGACCACAATTCACTTTTTAGTTTTCTTTTACTTAAATCCCATCTGCAGTCTCAAATTTAAGTTCTCCCAGTAG 

AGATTGAGTTTGAGCCTGTATATCTATTAAAAATTTCAACTTCCCACATATATTTACTAAGATGATTAAGACTTA 

CATTTTCTGCACAGGTCTGCAAAAAGAAAAATTATAAACTAGTCCATCGAAGAACC^ 

TGCTATAAGCTTGTGAAATGAAAATGGAACATTTCAATCAAAC7VTTTCCTATATAACAATTATTATATO 

TTGGTTTCTGCAATATTTTTCTTATGTCCACCCTTTTAAAAATTATTATTTGAA 

TTAATGAGATGTATTTTCTTATAGAGATATTTCTTACAGAAAGCTTTGTAGCAGAATATATTTGCAGCTATTGAC 

TTTGTAATTTAGGAAAAATGTATAATAAGATAAAATCTATTAAATTTTTCTCCT 

AAAAAAAAAAAAAAAAAAAA 
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MAALMRSKDSSCCLLLLAAVLMVESSQIGSSRAKLNSIKSSLGGETPGQAANRSAGMYQGLA 
FGGSKKGKNLGQAYPCSSDKECEVGRYCHSPHQGSSACMVCRRKKKRCHRDGMCCPSTRCNN 
G I C I PVTES ILTPHI P ALDGTRHRDRNHGH YS NHDLGWQNLGRPHTKMSH I KGHEGDPCLRS 
SDCIEGFCCARHFWTKICKPVLHQGEVCTKQRKKGSHGLEIFQRCDCAKGLSCKVWKDATYS 
SKARLHVCQKI 

Signal peptide: 

amino acids 1-25 
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TCTCAATCTGCTGACCTCGTGATCCGCCTGACCTTGTAATCCACCTACCTTGGCCTCCCAAA 
GTGTTGGGATTACAGGCGTGAGCCACCGCGCCCGGCCAACATCACGTTTTTAAAAATTGATT 
TCTTCAAATTCATGGC AAAT AT TT C C CTTC C CTTTAACTT C T TATGT C AG AATG AGGAAGGA 
TAGCTG CATTTATTTAGT C AGTTT TC AT TG CAT AGT AAT ATT TT C ATGTAGTATTTTCTAAG 
TTATATTTTAGTAATTCATATGTTTTAGATTATAGGTTTTAACATACTTGTGAAAATACTTG 
ATGTGTTTTAAAGCCTTGGGCAGAAATTCTGTATTGTTGAGGATTTGTTCTTTTATCCCCCT 
TTTAAAGTCATCCGTCCTTGGCTCAGGATTTGGAGAGCTTGCACCACCAAAAATGGCAAACA 
TCACCAGCTCCCAGATTTTGGACCAGTTGAAAGCTCCGAGTTTGGGCCAGTTTACCACCACC 
CCAAGT AC AC AG CAGAAT AG TAC AAGTC AC CCTACAACTACTACTT CTTGGGAC CT CAAG C C 
CCCAACATCCCAGTCCTCAGTCCTCAGTCATCTTGACTTCAAATCTCAACCTGAGCCATCCC 
CAGTTCTTAGCCAGTTGAGCCAGCGACAACAGCACCAGAGCCAGGCAGTCACTGTTCCTCCT 
CCTGGTTTGGAGTCCTTTCCTTCCCAGGCAAAACTTCGAGAATCAACACCTGGAGACAGTCC 
CTCCACTGTGAACAAGCTTTTGCAGCTTCCCAGCACGACCATTGAAAATATCTCTGTGTCTG 
TCCACCAGCCAC AGCC CAAACACAT C AAACTTGCTAAG CGGCGG AT AC C C CCAGCTTCTAAG 
AT CC CAGCTTCTGCAGTGGAAATG CC TGGTT CAG CAGATGTC AC AGGATT AAATGTGCAGTT 
TGGGGCTCTGGAATTTGGGTCAGAACCTTCTCTCTCTGAATTTGGATCAGCTCCAAGCAGTG 
AAAATAGT AATC AG ATT CCC AT CAG CTTGTATTCGAAGTCTTTAAGTGAGC CTTTGAATACA 
TCTTTATCAATGACCAGTGCAGTACAGAACTCCACATATACAACTTCCGTCATTACCTCCTG 
CAGTCTGACAAGCTCATCACTGAATTCTGCTAGTCCAGTAGCAATGTCTTCCTCTTATGACC 
AGAGTT CTGTGCATAACAGGATC CCATJVCCAAAGCCCTGTGAGTTCATCAGAGTCAGCTCCA 
GGAAC CAT CATGAATGGACATGGTGGTGGTCGAAGTCAGCAGAC ACTAGACAGT AAGTATAG 
CAGC AAGCTACT CTTGT CATGG CTGGTGCCAACCAAAC AGAGGAAG AGGATAGCTCACGTGA 
TGTGGAAAACACCAGTTGGTCAATGGCTCATTCGTTAAAAAGCAGCCCTTTTGCTTTTTTGT 
TTTTGG AC CAGGTGTTGG CTGTGGTGTTATTAGAAATGTCTTAACC ACAG CAAG AAGGAGGT 
GGTGGTCT CATATTCTTCTG CCCTAATC AGACTG CAC CACAAGT GC AG CATACAGTATGC AT 
TTTAAAGATGCTTGGGCCAGGCGGGGTGGCTGATGCCCATAATCCCAGTGCTTTGGGGGGCC 
AAGGCAGGCAGATTGCCCAAGCTCAGGAGTTTGAGACCACCCTGGGCAACATGGTGAAACTC 
TGTCTCTACTAAAATACGAAAAACTAGCCGGGTGTGGTGGCGGCGCGTGCCTGTAATCCCAG 
CTACTTGGGAGGCTGAGGCACAAGAATCGCTTGAGCCAGCTTGGGCTACAAAGTGAGACTCC 
GTCTGAAAAGA 
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MCFKALGRNSVLLRICSFIPLLKSSVLGSGFGELAPPKMANITSSQILDQLKAPSLGQFTTT 
PSTQQNSTSHPTTTTSWDLKPPTSQSSVLSHLDFKSQPEPSPVLSQLSQRQQHQSQAVTVPP 
PGLESFPSQAKLRESTPGDSPSTWKLLQLPSTTIENISVSVHQPQPKHIKLAKRRIPPASK 
I PASAVEMPGSADVTGLNVQFGALEFGS EPSLSEFGSAPS SENSNQ I P I SLYSKSLSEPLNT 
SLSMTSAVQNSTYTTSVITSCSLTSSSLNSASPVAMSSSYDQSSVHNRIPYQSPVSSSESAP 
GTIMNGHGGGRSQQTLDSKYSSKLLLSWLVPTKQRKRIAHVMWKTPVGQWLIR 

Signal peptide: 

amino acids 1-24 
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GCCGAGTGGGACAAAGCCTGGGGCTGGGCGGGGGCCATGGCGCTGCCATCCCGAATCCTGCT 
TTGGAAACTTGTGCTTCTGCAGAGCTCTGCTGTTCTCCTGCACTCAGCGGTGGAGGAGACGG 
ACGCGGGGCTGTACACCTGCAACCTGCACCATCACTACTGCCACCTCTACGAGAGCCTGGCC 
GTCCGCCTGGAGGTCACCGACGGCCCCCCGGCCACCCCCGCCTACTGGGACGGCGAGAAGGA 
GGTGCTGGCGGTGGCGCGCGGCGCACCCGCGCTTCTGACCTGCGTGAACCGCGGGCACGTGT 
GGACCGACCGGCACGTGGAGGAGGCTCAACAGGTGGTGCACTGGGACCGGCAGCCGCCCGGG 
GTCCCGCACGACCGCGCGGACCGCCTGCTGGACCTCTACGCGTCGGGCGAGCGCCGCGCCTA 
CGGGCCCCTTTTTCTGCGCGACCGCGTGGCTGTGGGCGCGGATGCCTTTGAGCGCGGTGACT 
TCTCACTGCGTATCGAGCCGCTGGAGGTCGCCGACGAGGGCACCTACTCCTGCCACCTGCAC 
CACCATTACTGTGGCCTGCACGAACGCCGCGTCTTCCACCTGACGGTCGCCGAACCCCACGC 
GGAGCCGCCCCCCCGGGGCTCTCCGGGCAACGGCTCCAGCCACAGCGGCGCCCCAGGCCCAG 
ACCCCACACTGGCGCGCGGCCACAACGTCATCAATGTCATCGTCCCCGAGAGCCGAGCCCAC 
TTCTTCCAGCAGCTGGGCTACGTGCTGGCCACGCTGCTGCTCTTCATCCTGCTACTGGTCAC 
TGTCCTCCTGGCCGCCCGCAGGCGCCGCGGAGGCTACGAATACTCGGACCAGAAGTCGGGAA 
AGTCAAAGGGGAAGGATGTTAACTTGGCGGAGTTCGCTGTGGCTGCAGGGGACCAGATGCTT 
TACAGG AGTG AGGAC ATC CAGCTAGATT ACAAAAACAACAT C CTGAAGGAGAGGG CGGAGCT 
GGCCCACAGCCCCCTGCCTGCCAAGTACATCGACCTAGACAAAGGGTTCCGGAAGGAGAACT 
GCAA ATAG GGAGGCCCTGGGCTCCTGGCTGGGCCAGCAGCTGCACCTCTCCTGTCTGTGCTC 
CTCGGGGCATCTCCTGATGCTCCGGGGCTCACCCCCCTTCCAGCGGCTGGTCCCGCTTTCCT 
GGAATTTGGCCTGGGCGTATGCAGAGGCCGCCTCCACACCCCTCCCCCAGGGGCTTGGTGGC 
AGCATAGCCCCCACCCCTGCGGCCTTTGCTCACGGGTGGCCCTGCCCACCCCTGGCACAACC 
AAAATCCCACTGATGCCCATCATGCCCTCAGACCCTTCTGGGCTCTGCCCGCTGGGGGCCTG 
AAGACATTCCTGGAGGACACTC CCATCAGAAC CTGGCAGCCCCAAAACTGGGGTCAGCCTCA 
GGGCAGGAGTCCCACTCCTCCAGGGCTCTGCTCGTCCGGGGCTGGGAGATGTTCCTGGAGGA 
GGAGACTCCCATCAGAACTTGGCAGCCTTGAAGTTGGGGTCAGCCTCGGCAGGAGTCCCACT 
CCTCCTGGGGTGCTGCCTGCCACCAAGAGCTCCCCCACCTGTACCACCATGTGGGACTCCAG 
GCACCATCTGTTCTCCCCAGGGACCTGCTGACTTGAATGCCAGCCCTTGCTCCTCTGTGTTG 
CTTTGGGCCACCTGGGGCTGCACCCCCTGCCCTTTCTCTGCCCCATCCCTACCCTAGCCTTG 
CTCTCAGCCACCTTGATAGTCACTGGGCTCCCTGTGACTTCTGACCCTGACACCCCTCCCTT 
GGACTCTGCCTGGGCTGGAGTCTAGGGCTGGGGCTACATTTGGCTTCTGTACTGGCTGAGGA 
CAGGGGAGGGAGTGAAGTTGGTTTGGGGTGGCCTGTGTTGCCACTCTCAGCACCCCACATTT 
GCATCTGCTGGTGGACCTGCCACCATCACAATAAAGTCCCCATCTGATTTTTAAAAAAAAAA 
AAAAA 
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< /usr/seqdb2 /sst /DNA/Dnaseqs . min/ ss . DNA6 0618 
<subunit 1 of 1, 341 aa, 1 stop 
<MW: 38070, pi: 6.88, NX(S/T): 1 

MALPSRILLWKLVLLQSSAVLLHSAVEETDAGLYTCNLHHHYCHLYESLAVRLEVTDGPPAT 
PAYWDGEKEVLAVARGAPALLTCVNRGHWTDRHVEEAQ 

YASGERRAYGPLFLRDRVAVGADAFERGDFSLRIEPLEVADEGTYSCHLHHHYCGLHERRVF 
HLTVAEPHAEPPPRGSPGNGSSHSGAPGPDPTLARGHNVINVIVPESRAHFFQQLGYVLATL 
LLFILLLVTVLLAARRRRGGYEYSDQKSGKSKGKDVNLAEFAVAAGDQMLYRSEDIQLDYKN 
N I LKERAELAHS PLPAKY I DLDKGFRKENCK 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 237-262 

N-glycosylation site. 

amino acids 205-208 

Cell attachment sequence. 

amino acids 151-154 

Coproporphyrinogen XIX oxidase proteins. 

amino acids 115-140 
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FIGURE 47 

CGCCGGAGGCAGCGGCGGCGTGGCGCAGCGGCGACATGGCCGTTGTCTCAGAGGACGACTTT 
CAGCACAGTTCAAACTCCACCTACGGAACCACAAGCAGCAGTCTCCGAGCTGACCAGGAGGC 
ACTGCTTGAGAAGCTGCTGGACCGCCCGCCCCCTGGCCTGCAGAGGCCCGAGGACCGCTTCT 
GTGGCACATACATCATCTTCTTCAGCCTGGGCATTGGCAGTCTACTGCCATGGAACTTCTTT 
ATCACTGCCAAGGAGTACTGGATGTTCAAACTCCGCAACTCCTCCAGCCCAGCCACCGGGGA 
GGACCCTGAGGGCTCAGACATCCTGAACTACTTTGAGAGCTACCTTGCCGTTGCCTCCACCG 
TGCCCTCCATGCTGTGCCTGGTGGCCAACTTCCTGCTTGTCAACAGGGTTGCAGTCCACATC 
CGTGTCCTGGCCTCACTGACGGTCATCCTGGCCATCTTCATGGTGATAACTGCACTGGTGAA 
GGTGGACACTTCCTCCTGGACCCGTGGTTTTTTTGCGGTCACCATTGTCTGCATGGTGATCC 
TCAGCGGTGCCTCCACTGTCTTCAGCAGCAGCATCTACGGCATGACCGGCTCCTTTCCTATG 
AGGAACTCCCAAGCACTGATATCAGGAGGAGCCATGGGCGGGACGGTCAGCGCCGTGGCCTC 
ATTGGTGGACTTGGCTGCATCCAGTGATGTGAGGAACAGCGCCCTGGCCTTCTTCCTGACGG 
CCACCATCTTCCTCGTGCTCTGCATGGGACTCTACCTGCTGCTGTCCAGGCTGGAGTATGCC 
AGGTACTACATGAGGCCTGTTCTTGCGGCCCATGTGTTTTCTGGTGAAGAGGAGCTTCCCCA 
GGACTCCCTCAGTGCCCCTTCGGTGGCCTCCAGATTCATTGATTCCCACACACCCCCTCTCC 
GCCCCATCCTGAAGAAGACGGCCAGCCTGGGCTTCTGTGTCACCTACGTCTTCTTCATCACC 
AGCCTCATCTACCCCGCCGTCTGCACCAACATCGAGTCCCTCAACAAGGGCTCGGGCTCACT 
GTGGACCACCAAGTTTTTCATCCCCCTCACTACCTTCCTCCTGTACAACTTTGCTGACCTAT 
GTGGCCGGCAGCTCACCGCCTGGATCCAGGTGCCAGGGCCCAACAGCAAGGCGCTCCCAGGG 
TTCGTGCTCCTCCGGACCTGCCTCATCCCCCTCTTCGTGCTCTGTAACTACCAGCCCCGCGT 
CCACCTGAAGACTGTGGTCTTCCAGTCCGATGTGTACCCCGCACTCCTCAGCTCCCTGCTGG 
GGCTCAGCAACGGCTACCTCAGCACCCTGGCCCTCCTCTACGGGCCTAAGATTGTGCCCAGG 
GAGCTGGCTGAGGCCACGGGAGTGGTGATGTCCTTTTATGTGTGCTTGGGCTTAACACTGGG 
CTCAGCCTGCTCTACCCTCCTGGTGCACCTCATCTAGAAGGGAGGACACAAGGACATTGGTG 
CTTCAGAGCCTTTGAAGATGAGAAGAGAGTGCAGGAGGGCTGGGGGCCATGGAGGAAAGGCC 
TAAAGTTTCACTTGGGGACAGAGAGCAGAGCACACTCGGGCCTCATCCCTCCCAAGATGCCA 
GTGAGCCACGTCCATGCCCATTCCGTGCAAGGCAGATATTCCAGTCATATTAACAGAACACT 
CCTGAGACAGTTGAAGAAGAAATAGCACAAATCAGGGGTACTCCCTTCACAGCTGATGGTTA 
ACATTCCACCTTCTTTCTAGCCCTTCAAAGATGCTGCCAGTGTTCGCCCTAGAGTTATTACA 
AAGCCAGTGCCAAAACCCAGCCATGGGCTCTTTGCAACCTCCCAGCTGCGCTCATTCCAGCT 
GACAGCGAGATGCAAGCAAATGCTCAGCTCTCCTTACCCTGAAGGGGTCTCCCTGGAATGGA 
AGTCCCCTGGCATGGTCAGTCCTCAGGCCCAAGACTCAAGTGTGCACAGACCCCTGTGTTCT 
GCGGGTGAACAACTGCCCACTAACCAGACTGGAAAACCCAGAAAGATGGGCCTTCCATGAAT 
GCTTCATTCCAGAGGGACCAGAGGGCCTCCCTGTGCAAGGGATCAAGCATGTCTGGCCTGGG 
TTTTCAAAAAAAGAGGGATCCTCATGACCTGGTGGTCTATGGCCTGGGTCAAGATGAGGGTC 
TTTCAGTGTTCCTGTTTACAACATGTCAAAGCCATTGGTTCAAGGGCGTAATAAATACTTGC 
GTATTCAAAAA 
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MAWSEDDFQHSSNSTYGTTSSSLRADQEALLEKLLDRPPPGLQRPEDRFCGTYI I FFSLGI 
GSLLPWNFF ITAKE YWMFKLRNSS S PATGEDPEGSD I LNYFES YLAVASTVPSMLCLVANFL 
L VNR VAVH I R VLAS LTV I LA I FMV I T AL VK VDT S S WTRG F F AVT I VCMV I LS GAS TVFS SSI 
YGMTGSFPMRNSQALISGGAMGGTVSAVASLVDLAASSDVRNSALAFFLTATIFLVLCMGLY 
LLLSRLEYARYYMRPVLAAHVFSGEEELPQDSLSAPSVASRFIDSHTPPLRPILKKTASLGF 
CVTYVFFITSLIYPAVCTNIESLNKGSGSLWTTKFFIPLTTFLLYNFADLCGRQLTAWIQVP 
GPNSKALPGFVLLRTCLIPLFVLCNYQPRVHLKTVVFQSDVYPALLSSLLGLSNGYLSTLAL 
LYGPKIVPRELAEATGWMSFYVCLGLTLGSACSTLLVHLI 

Transmembrane domain: 

amino acids 50-74 (type II), 105-127, 135-153, 163-183, 228-252, 
305-330, 448-472 
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GACAGTGGAGGGCAGTGGAGAGGACCGCGCTGTCCTGCTGTCACCAAGAGCTGGAGACACCA 
TCTCCCACCGAGAGTCATGGCCCCATTGGCCCTGCACCTCCTCGTCCTCGTCCCCATCCTCC 
TCAGCCTGGTGGCCTCCCAGGACTGGAAGGCTGAACGCAGCCAAGACCCCTTCGAGAAATGC 
ATGCAGGATCCTGACTATGAGCAGCTGCTCAAGGTGGTGACCTGGGGGCTCAATCGGACCCT 
GAAGCCCCAGAGGGTGATTGTGGTTGGCGCTGGTGTGGCCGGGCTGGTGGCCGCCAAGGTGC 
TCAGCGATGCTGGACACAAGGTCACCATCCTGGAGGCAGATAACAGGATCGGGGGCCGCATC 
TTCACCTACCGGGACCAGAACACGGGCTGGATTGGGGAGCTGGGAGCCATGCGCATGCCCAG 
CTCTCACAGGATCCTCCACAAGCTCTGCCAGGGCCTGGGGCTCAACCTGACCAAGTTCACCC 
AGTACGACAAGAACACGTGGACGGAGGTGCACGAAGTGAAGCTGCGCAACTATGTGGTGGAG 
AAGGTGCCCGAGAAGCTGGGCTACGCCTTGCGTCCCCAGGAAAAGGGCCACTCGCCCGAAGA 
CATCTACCAGATGGCTCTCAACCAGGCCCTCAAAGACCTCAAGGCACTGGGCTGCAGAAAGG 
CGATGAAGAAGTTTGAAAGGCACACGCTCTTGGAATATCTTCTCGGGGAGGGGAACCTGAGC 
CGGCCGGCCGTGCAGCTTCTGGGAGACGTGATGTCCGAGGATGGCTTCTTCTATCTCAGCTT 
CGCCGAGGCCCTCCGGGCCCACAGCTGCCTCAGCGACAGACTCCAGTACAGCCGCATCGTGG 
GTGGCTGGGACCTGCTGCCGCGCGCGCTGCTGAGCTCGCTGTCCGGGCTTGTGCTGTTGAAC 
GCGCCCGTGGTGGCGATGACCCAGGGACCGCACGATGTGCACGTGCAGATCGAGACCTCTCC 
CCCGGCGCGGAATCTGAAGGTGCTGAAGGCCGACGTGGTGCTGCTGACGGCGAGCGGACCGG 
CGGTGAAGCGCATCACCTTCTCGCCGCCGCTGCCCCGCCACATGCAGGAGGCGCTGCGGAGG 
CTGCACTACGTGCCGGCCACCAAGGTGTTCCTAAGCTTCCGCAGGCCCTTCTGGCGCGAGGA 
GCACATTGAAGGCGGCCACTCAAACACCGATCGCCCGTCGCGCATGATTTTCTACCCGCCGC 
CGCGCGAGGGCGCGCTGCTGCTGGCCTCGTACACGTGGTCGGACGCGGCGGCAGCGTTCGCC 
GGCTTGAGCCGGGAAGAGGCGTTGCGCTTGGCGCTCGACGACGTGGCGGCATTGCACGGGCC 
TGTCGTGCGCCAGCTCTGGGACGGCACCGGCGTCGTCAAGCGTTGGGCGGAGGACCAGCACA 
GCCAGGGTGGCTTTGTGGTACAGCCGCCGGCGCTCTGGCAAACCGAAAAGGATGACTGGACG 
GTCCCTTATGGCCGCATCTACTTTGCCGGCGAGCACACCGCCTACCCGCACGGCTGGGTGGA 
GACGGCGGTCAAGTCGGCGCTGCGCGCCGCCATCAAGATCAACAGCCGGAAGGGGCCTGCAT 
CGGACACGGCCAGCCCCGAGGGGCACGCATCTGACATGGAGGGGCAGGGGCATGTGCATGGG 
GTGGCCAGCAGCCCCTCGCATGACCTGGCAAAGGAAGAAGGCAGCCACCCTCCAGTCCAAGG 
CCAGTTATCTCTCCAAAACACGACCCACACGAGGACCTCGCATTAAAGTATTTTCGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MAPLALHLLVLVPILLSLVASQDWKAERSQDPFEKCMQDPDYEQLLKVVTWGLNRTLKPQRV 
I VVGAGVAGLVAAKVLSDAGHKVT I LE ADNR I GGR I FT YRDQNTGW I GELGAMRMP S S HR I L 
HKLCQGLGLNLTKFTQYDKNTWTEVHEVKLRNYVVEKVPEKLGYALRPQEKGHSPEDIYQMA 
LNQALKDLKALGCRKAMKKFERHTLLEYLLGEGNLSRPAVQLLGDVMSEDGFFYLSFAEALR 
AHS CL S DRLQ YSR I VGGWDLLPRALLS S LS GLVLLNAP WAMTQGPHDVHVQ I ETS P PARNL 
KVLKADVVLLTASGPAVKRITFSPPLPRHMQEALRRLHYVPATKVFLSFRRPFWREEHIEGG 
HSNTDRPSRMIFYPPPREGALLIiASYTWSDAAAAFAGLSREEALRLALDDVAALHGPVVRQL 
WDGTGWKRWAEDQHSQGGFWQPPALWQTEKDDWTVPYGRIYFAGEHTAYPHGWVETAVKS 
ALRAAIKINSRKGPASDTASPEGHASDMEGQGHVHGVASSPSHDLAKEEGSHPPVQGQLSLQ 
NTTHTRTSH 

Signal peptide: 

amino acids 1-21 



WO 00/12708 



PCT/US99/20111 



FIGURE 51 



CTGACATGGCCTGACTCGGGACAGCTCAGAGCAGGGCAGAACTGGGGACACTCTGGGCCGGCCTTCTG 

CCTGCATGGACGCTCTGAAGCCACCCTGTCTCTGGAGGAACCACGAGCGAGGGAAGAAGGACAGGGAC 

TCGTGTGGCAGGAAGAACTC AG AGCCGGGAAGC C CC CATT CACTAGAAGCACTGAGAGATG CGGCCCC 

CTCGCAGGGTCTGAATTTCCTGCTGCTGTTCACAAAGATGCTTTTTATCTTTAACTTTTTGTTTTCCC 

CACTTCCGACCCCGGCGTTGATCTGCATCCTGACATTTGGAGCTGCCATCTTCTTGTGGCTGATCACC 

AGACCTCAACCCGTCTTACCTCTTCTTGACCTGAACAATCAGTCTGTGGGAATTGAGGGAGGAGCACG 

GAAGGGGGTTTC CC AGAAGAACAATGACCTAACAAGTTGCTGCTT CTCAGATGCCAAGACTATGTATG 

AGGTTTTCCAAAGAGGACT CGCTGTGTCTGACAATGGGC CCTGCTTGGGATATAGAAAACCAAACCAG 

CCCTACAGATGGCTATCTTACAAACAGGTGTCTGATAGAGCAGAGTACCTGGGTTCCTGTCTCTTGCA 

TAAAGGTTATAAATCATCAC CAGAC CAGTTTGTCGGCATCTTTGCTCAGAATAGGCCAGAGTGGATCA 

TCTCCGAATTGGCTTGTTACACGTACTCTATGGTAGCTGTACCTCTGTATGACACCTTGGGACCAGAA 

GCCATCGTACATATTGTCAACAAGGCTGATATCGCCATGGTGATCTGTGACAC^CCCCAAAAGGCATT 

GGTGCTGATAGGGAATGTAGAGAAAGGCTTCACCCCGAGCCTGAAGGTGATC AT CCTT ATGGAC CC CT 

TTGATGATGACCTGAAGCAAAGAGGGGAGAAGAGTGGAATTGAGATCTTATCCCTATATGATGCTGAG 

AACCTAGGCAAAGAGCACTTCAGAAAACCTGTGCCTCCTAGCCCAGAAG ACCTGAG CGTC AT CTGCTT 

CACCAGTGGGACCACAGGTGACCCCAAAGGAGCCATGATAACCCATCAAAATATTGTTTCAAATGCTG 

CTGCCTTTCTCAAATGTGTGGAGCATGCTTATGAGCCCACTCCTGATGATGTGGCCATATCCTACCTC 

CCTCTGGCTCATATGTTTGAGAGGATTGTACAGGCTGTTGTGTACAGCTGTGGAGCCAGAGTTGGATT 

CTTCCAAGGGGATATTCGGTTGCTGGCTGACGACATGAAGACTTTGAAGCCCACATTGTTTCCCGCGG 

TG CCTCGACT CCTT AACAGGATCTACGATAAGGTACAAAATGAGGCCAAGACACCCTTGAAGAAGTT C 

TTGTTGAAGCTGGCTGTTTCCAGTAAATTCAAAGAGCTTCAAAAGGGTATCATCAGGCATGATAGTTT 

CTGGGACAAGCTCATCTTTGCAAAGATCCAGGACAGCCTGGGCGGAAGGGTTCGTGTAATTGTCACTG 

GAGCTGCCCCCATGTCCACTTCAGTCATGACATTCTTCCGGGCAGCAATGGGATGTCAGGTGTATGAA 

GCTTATGGTC?U^CAGAATGCACAGGTGGCTGTACATTTACATTACCTGGGGACTGGACATCA^^ 

CGTTGGGGTGCCCCTGGCTTGCAATTACGTGAAGCTGGAAGATGTGGCTGACATGAACTACTTTACAG 

TGAATAATGAAGGAGAGGTCTGCATCAAGGGTACAAACGTGTTCAAAGGATACCTGAAGGACCCTGAG 

AAGACACAGGAAGC CCTGGACAGTGATGGCTGGCTTCACACAGGAGACATTGGTCGCTGGCTC CCGAA 

TGGAACTCTGAAGATCATCGACCGTAAAAAGAACATTTTCAAGCTGGCCCAAGGAGAATACATTGCAC 

CAGAGAAGATAGAAAATATCTACT^CAGGAGTCAACCAGTGTTACAAATTTTTGTACACGGGGAGAGC 

TTACGGT CATCCTTAGT AGGAGTGGTGGTTCCTGACACAGATGTACTTCC CTCATTTGCAGCCAAGCT 

TGGGGTGAAGGG CT CCTTTGAGGAACTGTGCCAAAACCAAGTTGTAAGGGAAGCCATTTTAGAAGACT 

TGCAGAAAATTGGGAAAGAAAGTGGCCTTAAAACTTTTGAACT^GGTCAAAGCCATTTTTCTTCATCCA 

GAGCCATTTTCC^TTGAAAATGGGCTCTTGACACC^C^TTGAAAGC^AAGCGAGGAGAGCTTTCCAA 

ATACTTTCGGACCCAAATTGACAGCCTGTATGAGCACATCCAGGATTAGGATAAGGTACTTAAGTACC 

TGCCGGCCCACTGTGCACTGCTTGTGAGAAAATGGATTAAAAACTATTCTTACATTTGTTTTGCCTTT 

CCTCCTATTTTTTTTTAACCTGTTAAACTCTAAAGCCATAGCTTTTGTTTTATATTGAGACATATAAT 

GTGTAAACTTAGTTCC<^U^TAAATCAATCCTGTCTTTCCCATCTTCGATGTTGCTAATATTAAGGCT 

TCAGGGCTACTTTTATCAACATGCCTGTCTTCAAGATCCCAGTTTATGTTCTGTGTCCTTCCTCATGA 

TTTCCAACCTTAATACTATTAGTAACCAC^GTTCAAGGGTCAAAGGGAC CCTCTGTGCCTT CTTCTT 

TGTTTTGTGATAAACATAACTTGCCAACAGTCTCTATGCTTATTTACATCTTCTACTGTTCAAACTAA 

GAGATTTTTAAATT CTGAAAAACTGCTTACAATTCATGTTTT CTAGCCACTC CACAAACCACTAAAAT 

TTT AGTTTT AGCCTATC ACTC ATGTCAAT CATATCTATGAGACAAATGTCT CCGATGCTCTTCTGCGT 

AAATTAAATTGTGTACTGAAGGGAAAAGTTTGAT CATACCAAACATTT CCTAAACTCTCTAGTTAGAT 

ATCTGACTTGGGAGTATTAAAAATTGGGTCTATGACATACTGTCCAAAAGGAATGCTGTTCTTAAAGC 

ATTATTTACAGTAGGAACTGGGGAGTAAATCTGTTCCCTACAGTTTGCTGCTGAGCTGGAAGCTGTGG 

GGGAAGGAGTTGACAGGTGGGCCCAGTGAACTTTTCCAGTAAATGAAGCT^GCACTGAATAAAAACCT 

CCTGAACTGGGAACAAAGATCT ACAGGCAAGCAAGATGCC CACACAACAGG CTTATTTTCTGTGAAGG 

AACCAACTGATCTCCCCCACCCTTGGATTAGAGTTCCTGCTCTACCTTACCCACAGATAACACATGTT 

GTTTCTACTTGTAAATGTAAAGTCTTTAAAATAAACTATTACAGATAAAAAA 
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FIGURE 52 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA60775 
<subunit 1 of 1, 739 aa, 1 stop 
<MW: 82263, pi: 7.55, NX(S/T): 3 

MDALKPP CLWRNHERGKKDRDS CGRKNS E PGS PH S LEALRDAAP SQGLNFLLLFTKMLF I FN 
FLFSPLPTPALICILTFGAAIFLWLITRPQPVLPLLDLNNQSVGIEGGARKGVSQKNNDLTS 
CCFSDAKTMYEVFQRGLAVSDNGPCLGYRKPNQPYRWLSYKQVSDRAEYLGSCLLHKGYKSS 
PDQFVGIFAQNRPEWIISELACYTYSMVAVPLYDTLGPEAIVHIVNKADIAMVICDTPQKAL 
VLIGNVEKGFTPSLKVIILMDPFDDDLKQRGEKSGIEILSLYDAENLGKEHFRKPVPPSPED 
LSVICFTSGTTGDPKGAMITHQNIVSNAAAFLKCVEHAYEPTPDDVAISYLPLAHMFERIVQ 
AVVYSCGARVGFFQGDIRLIiADDMKTLKPTLFPAVPRLLNRIYDKVQNEAKTPLKKFLLKLA 
VSSKFKELQKGIIRHDSFWDKLIFAKIQDSLGGRVRVIVTGAAPMSTSVMTFFRAAMGCQVY 
EAYGQTECTGGCTFTLPGDWTSGHVGVPLACNYVKLiEDVADMNYFTVNNEGEVC I KGTNVFK 
GYLKDPEKTQEAIiDSDGWLHTGD I GRWLPNGTLK 1 1 DRKKN I FKLAQGE Y I APEKI ENIYNR 
SQPVLQIFVHGESLRSSLVGVWPDTDVLPSFAAKLGVKGSFEELCQNQWREAILEDLQKI 
GKESGLKTFEQVKAIFLHPEPFSIENGLLTPTLKAKRGELSKYFRTQIDSLYEHIQD 

Important features: 

Type II transmembrane domain: 

amino acids 61-80 

Putative AMP -binding domain signature. 

amino acids 314-325 

N-glycosylation site. 

amino acids 102-105, 588-591 and 619-622 



• 
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FTGT7RE 53 



GGAGGCGGAGGCCGCGGCGAGCCGGGCCGAGCAGTGAGGGCCCTAGCGGGGCCCGAGCGGGG 

CCCGGGGCCCCTAAGCCATTCCTGAAGTCATGGGCTGGCCAGGACATTGGTGACCCGCCAAT 

CCGGTATGGACGACTGGAAGCCCAGCCCCCTCATCAAGCCCTTTGGGGCTCGGAAGAAGCGG 

AGCTGGTACCTTACCTGGAAGTATAAACTGACAAACCAGCGGGCCCTGCGGAGATTCTGTCA 

GACAGGGGCCGTGCTTTTCCTGCTGGTGACTGTCATTGTCAATATCAAGTTGATCCTGGACA 

CT CGGCGAG C CAT CAGTG AAGCC AATGAAGAC CCAG AG CC AGAG CAAG ACTATG ATGAGGCC 

CTAGGCCGCCTGGAGCCCCCACGGCGCAGAGGCAGTGGTCCCCGGCGGGTCCTGGACGTAGA 

GGTGTATTCAAGTCGC AG CAAAGTATATGTGG CAGTGGATGGCACC ACGGTGCTGGAGGATG 

AGGCCCGGGAGCAGGGCCGGGGCATCCATGTCATTGTCCTCAACCAGGCCACGGGCCACGTG 

ATGGCAAAACGTGTGTTTGACACGTACTCACCTCATGAGGATGAGGCCATGGTGCTATTCCT 

CAACATGGTAGCGCCCGGCCGAGTGCTCATCTGCACTGTCAAGGATGAGGGCTCCTTCCACC 

TCAAGGACACAGCCAAGGCTCTGCTGAGGAGCCTGGGCAGCCAGGCTGGCCCTGCCCTGGGC 

TGGAGGGACACATGGGCCTTCGTGGGACGAAAAGGAGGTCCTGTCTTCGGGGAGAAACATTC 

TAAGTCACCTGCCCTCTCTTCCTGGGGGGACCCAGTCCTGCTGAAGACAGATGTGCCATTGA 

GCTCAGCAGAAGAGGCAGAGTGCCACTGGGCAGACACAGAGCTGAACCGTCGCCGCCGGCGC 

TT CTGC AG CAAAGTTGAGGGCTATGGAAGTGTATG CAG CTGC AAGG ACCC CACAC C CATCGA 

GTTC AG C C CTGAC C CACT CC CAGACAACAAGGT C CT CAATGTGC CTGTGG CTGT CATTGCAG 

GGAACCGACCCAATTACCTGTACAGGATGCTGCGCTCTCTGCTTTCAGCCCAGGGGGTGTCT 

CCTCAGATGATAACAGTTTTCATTGACGGCTACTATGAGGAACCCATGGATGTGGTGGCACT 

GTTTGGTCTGAGGGGCAT C CAG CATACT C C CATC AGCATC AAGAATG C CCGCGT GT CT CAGC 

ACTACAAGGCCAGCCTCACTGCCACTTTCAACCTGTTTCCGGAGGCCAAGTTTGCTGTGGTT 

CTGGAAGAGGACCTGGACATTGCTGTGGATTTTTTCAGTTTCCTGAGCCAATCCATCCACCT 

ACTGGAGGAGGATGAC AG CC TGTACTGC AT CT CTGC CTGGAATG AC CAGGGGTATGAACACA 

CGGCTGAGGACCCAGCACTACTGTACCGTGTGGAGACCATGCCTGGGCTGGGCTGGGTGCTC 

AGGAGGTCCTTGTACAAGGAGGAGCTTGAGCCCAAGTGGCCTACACCGGAAAAGCTCTGGGA 

TTGGGACATGTGGATGCGGATGCCTGAACAACGCCGGGGCCGAGAGTGCATCATCCCTGACG 

TTTCCCGATCCTACCACTTTGGCATCGTCGGCCTCAACATGAATGGCTACTTTCACGAGGCC 

TACTTCAAGAAGCACAAGTTCAACACGGTTCCAGGTGTCCAGCTCAGGAATGTGGACAGTCT 

GAAGAAAGAAGCTTATGAAGTGGAAGTTCACAGGCTGCTCAGTGAGGCTGAGGTTCTGGACC 

ACAGCAAG AACC CTTGTGAAGACTCTTTCCTGCCAGACACAGAGGGCCACAC CTACGTGGCC 

TTTATT CGAATGGAGAAAGATGATGACTTCAC CACCTGGACC C AGCTT GCCAAGTGCCTC C A 

TATCTGGGACCTGGATGTGCGTGGCAACCATCGGGGCCTGTGGAGATTGTTTCGGAAGAAGA 

ACCACTTCCTGGTGGTGGGGGTCCCGGCTTCCCCCTACTCAGTGAAGAAGCCACCCTCAGTC 

ACCCCAATTTTCCTGGAGCCACCCCCAAAGGAGGAGGGAGCCCCAGGAGCCCCAGAACAGAC 

ATGAGACCTCCTCCAGGACCCTGCGGGGCTGGGTACTGTGTACCCCCAGGCTGGCTAGCCCT 

TCCCTCCATCCTGTAGGATTTTGTAGATGCTGGTAGGGGCTGGGGCTACCTTGTTTTTAACA 

TGAGACTTAATTACTAACTCCAAGGGGAGGGTTC CC CTGCTCCAACACCCCGTT CCTGAGTT 

AAAAGTCTATTTATTTACTTCCTTGTTGGAGAAGGGCAGGAGAGTACCTGGGAATCATTACG 

ATCCCTAGCAGCTCATCCTGCCCTTTGAATACCCTCACTTTCCAGGCCTGGCTCAGAATCTA 

ACCTATTTATTGACTGTCCTGAGGGCCTTGAAAACAGGCCGAACCTGGAGGGCCTGGATTTC 

TTTTTGGGCTGGAATGCTGCCCTGAGGGTGGGGCTGGCTCTTACTCAGGAAACTGCTGTG.ee 

CAACCCATGGACAGGCCCAGCTGGGGCCCACATGCTGACACAGACTCACTCAGAGACCCTTA 

GACACTGGACCAGGCCTCCTCTCAGCCTTCTCTTTGTCCAGATTTCCAAAGCTGGATAAGTT 

GGTC ATTGATTAAAAAAGGAGAAGCCCT CTGGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA61185 
xsubunit 1 of 1, 660 aa, 1 stop 
><MW: 75220, pi: 6.76, NX(S/T) : 0 

MDDWKPSPLIKPFGARKKRSWYLTWKYKLTNQRALRRFCQTGAVLFLLVTVIVNIKLILDTR 
RAISEANEDPEPEQDYDEALGRLEPPRRRGSGPRRVLDVEVYSSRSKVYVAVDGTTVLEDEA 
REQGRGIHVIVLNQATGHVMAKRVFDTYSPHEDEAMVLFLNIWAPGRVLICTVKDEGSFHL^ 
DTAKALLRSLGSQAGPALGWRDTWAFVGRKGGPVFGEKHSKSPALSSWGDPVLLKTDVPLSS 
AEE AECHWADTELNRRRRRF CS KVEGYGS VCS CKD PTP I EFS PD PL PDNKVLNVP VAV I AGN 
RPNYLYRMLRSLLSAQGVSPQMITVFIDGYYEEPMDWALFGLRGIQHTP IS IKNARVSQHY 
KASLTATFNLFPEAKFAWLEEDIiDIAVDFFSFLSQSIHLLEEDDSLYCISAWNDQGYEHTA 
EDPALLYRVETMPGLGVTVLRRSLYKEELEPKWPTPEKLWDWDMWMRMPEQRRGRECII PDVS 
RSYHFGIVGLNMNGYFHEAYFKKHKFNTVPGVQLRNV^ 

KNPCEDSFLPDTEGHTYVAFIRMEKDDDFTTWTQLAKCLHIWDLDVRGNHRGLjWRLFRKKNH 
FLWGVPAS PYSVKKPPS VTP I FLEPPPKEEGAPGAPEQT 

Important features of the protein: 
Transmembrane domain: 

amino acids 3 8-55 

Homologous region to Mouse GNT1 

amino acids 229-660 



try /*7o 




• 
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CGGACGCGTGGGCTGCTGGTGGGAAGGCCTAAAGAACTGGAAAGCCCACTCTCTTGGAACCACCACAC 
CTGTTTAAAGAACCTAAGCACCATTTAAAGCCACTGGAAATTTGTTGTCTAGTGGTTGTGGGTGAATA 
AAGGAGGGCAGAATGGATGATTTCATCTCCATTAGCCTGCTGTCTCTGGCTATGTTGGTGGGATGTTA 
CGTGG C CGGAATC AT TC C CTTGG CTGTTAATTT CTCAGAGGAACGACTGAAGCTGGTGACTGTTTTGG 
GTGCTGGCCTTCTCTGTGGAACTGCTCTGGCAGTCATCGTGCCTGAAGGAGTACATGCCCTTTATGAA 
GATATT CTTGAGGGAAAACACC ACC AAGC AAGTG AAACAC ATAATGTGATTG CATCAGACAAAGCAG C 
AGAAAAATCAGTTGTCCATGAACATGAGCACAGCCACGACCACACACAGCTGCATGCCTATATTGGTG 
TTTCCCTCGTTCTGGGCTTCGTTTTCATGTTGCTGGTGGACCAGATTGGTAACTCCCATGTGCATTCT 
ACTGACGATCCAGAAGCAGCAAGGTCTAGCAATTCCAAAATCACCACCACGCTGGGTCTGGTTGTCCA 
TG CTGCAGCTGATGGTGTTGCTTTGGGAGCAGCAGCATCTACTTCACAGACCAGTGT CC AGTTAATTG 
TGTTTGTGGCAATCATGCTACATAAGGCACCAGCTGCTTTTGGACTGGTTTCCTTCTTGATGCATGCT 
GGCTTAGAGCGGAATCGAATCAGAAAGCACTTGCTGGTCTTTGCATTGGCAGCACCAGTTATGTCCAT 
GGTGACAT ACTTAGGACTGAGTAAGAGCAGTAAAGAAGC C CTTTCAG AGGTGAACGC CACGGGAGTGG 
CCATGCTTTTCTCTGCCGGGACATTTCTTTATGTTGCCACAGTACATGTCCTCCCTGAGGTGGGCGGA 
ATAGGGCACAGCCACAAGCCCGATGCCACGGGAGGGAGAGGCCTCAGCCGCCTGGAAGTGGCAGCCCT 
GGTTCTGGGTTGCCTCATCCCTCTCATCCTGTCAGTAGGACACCAGCATTAAATGTTCAAGGTCCAGC 
CTTGGTCCAGGGCCGTTTGCCATCCAGTGAGAACAGCCGGCACGTGACAGCTACTCACTTCCTCAGTC 
TCTTGT CT CACCTTGCG CATCTCTACATGTATTCCTAGAGTCCAGAGGGGAGGTGAGGTTAAAACCTG 
AGTAATGGAAAAGCTTTTAGAGTAGAAACACATTTACGTTGCAGTTAGCTATAGACATCCCATTGTGT 
TATCTTTTAAAAGGCCCTTGACATTTTGCGTTTTAATATTTCTCTTAACCCTATTCTCAGGGAAGATG 
GAATTTAGTTTTAAGGAAAAGAGGAGAACTTCATACTCACAATGAAATAGTGATTATGAAAATACAGT 
GTTCTGTAATTAAGCTATGTCTCTTTOTTCTTAGTTTAGAGGCTCTGCTACTTTATCC^TTGATTTTT 
AACATGGTTCCCACCATGTAAGACTGGTGCTTTAGCATCTATGCCACATGCGTTGATGGAAGGTCATA 
GCACCCACTCACTTAGATGCTAAAGGTGATTCTAGTTAATCTGGGATTAGGGTCAGGAAAATGATAGC 
AAGACACATTGAAAGCTCTCTTTATACTCAAAAGAGATATCCATTGAAAAGGGATGTCTAGAGGGATT 
TAAACAGCTCCTTTGGCACGTGCCTCTCTGAATCCAGCCTGCCATTCCATCAAATGGAGCAGGAGAGG 
TGGGAGGAGCTTCTAAAGAGGTGACTGGTATTTTGTAGCATTCCTTGTCAAGTTCTCCTTTGCAGAAT 
ACCTGTCTCCACATTCCTAGAGAGGAGCCAAGTTCTAGTAGTTTCAGTTCTAGGCTTTCCTTCAAGAA 
CAGTCAGATCACAAAGTGTCTTTGGAAATTAAGGGATATTAAATTTTAAGTGATTTTTGGATGGTTAT 

TTTTTTTTTAATTATTTCTCTTAGCAGATCAGCAATCCCTCTAGGGACCTAAATACTAGGTCAGCTTT 
GGCGACACTGTGTCTTCTCACATAACCACCTGTAGCAAGATGGATCATAAATGAGAAGTGTTTGCCTA 
TTGATTTAAAGCTTATTGGAATCATGTCTCTTGTCTCTTCGTCTTTTCTTTGCTTTTCTTCTAACTTT 
TCCCTCTAGCCTCTCCTCGCCACAATTTGCTGCTTACTGCTGGTGTTAATATTTGTGTGGGATGAATT 
CTTATCAGGACAACCACTTCTCGAACTGTAATAATGAAGATAATAATATCTTTATTCTTTATCCCCTT 
CAAAGAAATTACCTTTGTGTCAAATGCCGCTTTGTTGAGCCCTTAAAATACCACCTCCTCATGTGTAA 
ATTGACACAATCACTAATCTGGTAATTTAAACAATTGAGATAGCAAAAGTGTTTAACAGACTAGGATA 
ATTTTTTTTTCATATTTGCCAAAATTTTTGTAAACCCTGTCTTGTCAAATAAGTGTATAATATTGTAT 
TATTAATTTATTTTTACTTTCTATACCATTTCAAAACACATTACACTAAGGGGGAACCAAGACTAGrT 
TCTTCAGGGCAGTGGACGTAGTAGTTTGTAAAAACGTTTTCTATGACGCATAAGCTAGCATGCCTATG 
ATTTATTTCCTTCATGAATTTGTCACTGGATCAGCAGCTGTGGAAATAAAGCTTGTGAGCCCTCTGCT 
GGCCACAGTGAGGAAAGTAGCACAAATAGGATACAGTTGTATGTAGTCATTGGCAACAATTGCATACA 
ATTTTACTACCAAGAGAAGGTATAGTATGGAAAGTCCAAATGACTTCCrTGATTGGATGTTAACAGCT 
GACTGGTGTGAGACTTGAGGTTTCATCTAGTCCTTCAAAACTATATGGTTGCCTAGATTCTCTCTGGA 
AACTGACTTTGTCAAATAAATAGCAGATTGTAGTGTCAAAAAAAA 
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MDDF IS I SLLSLAMLVGCYVAGI I PLAVNFSEERLKLVTVLGAGLLCGTALAVI VPEGVHAL 
YEDILEGKHHQASETHNVIASDKAAEKSWHEHEHSHDHTQLHAYIGVSLVLGFVFMLLVDQ 
IGNSHVHSTDDPEAARSSNSKITTTLGLWHAAADGVALGAAASTSQTSVQLIVFVAIMLHK 
APAAFGLVS FLMHAGLERNR I RKHLLVF ALAAPVMSMVT YLGL S KS SKEALS EVNATGVAML 
FS AGTFLYVATVHVLPEVGG I GHSHKPDATGGRGLSRLE VAALVLGCL I PLI LS VGHQH 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 37-56, 106-122, 211-230, 240-260, 288-304 
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GCTCGAGGCCGGCGGCGGCGGGAGAGCGACCCGGGCGGCCTCGTAGCGGGGCCCCGGATCCC 
CGAGTGGCGGCCGGAGCCTCGAAAAGAGATTCTCAGCGCTGATTTTGAGATCATGGGCTTGG 
GAAACGGGCGTCGCAGCATGAAGTCGCCGCCCCTCGTGCTGGCCGCCCTGGTGGCCTGCATC 
ATCGTCTTGGGCTTCAACTACTGGATTGCGAGCTCCCGGAGCGTGGACCTCCAGACACGGAT 
CATGGAGCTGGAAGGCAGGGTCCGCAGGGCGGCTGCAGAGAGAGGCGCCGTGGAGCTGAAGA 
AGAACGAGTTC C AGGG AGAG CTGG AGAAGC AGCGGG AGC AGC TTGACAAAATCCAGT CCAGC 
CACAACTTCCAGCTGGAGAGCGTCAACAAGCTGTACCAGGACGAAAAGGCGGTTTTGGTGAA 
TAACAT C ACC AC AGGTGAGAGG CT CATC CGAGTGCTG CAAGACC AGTTAAAG ACCCTGCAGA 
GG AATT ACGGCAGG CTGCAG CAGG ATGT C CT C C AGTTT CAGAAGAAC CAGACC AAC CTGGAG 
AGGAAGTT CT CCT ACG AC CTG AGC CAGTG CATC AATCAGATGAAGGAGGTGAAGGAAC AGTG 
TG AGGAGCGAAT AG AAG AGGTCACC AAAAAGGGGAATGAAG CTGTAGCTT C CAG AG AC CTGA 
GTGAAAAC AACGAC CAGAGACAGCAGCTCCAAGC CCT CAGTG AG C CTCAGCC CAGG CTGCAG 
GCAGCAGGCCTGCCACACACAGAGGTGCCACAAGGGAAGGGAAACGTGCTTGGTAACAGCAA 
GT C CCAGACACCAGC CC CCAGTTC CGAAGTGGTTTTGG ATTC AAAG AG AC AAGTTG AGAAAG 
AGGAAACCAATGAGATC CAGGTGGTGAATGAGGAGC CT CAGAGGGACAGGCTGC CG CAGGAG 
C CAGG CCGGGAG CAGGTGGTGGAAGACAGAC CTGTAGGTGGAAGAGGCTT CGGGGGAGCCGG 
AGAACTGGGCCAGACCCCACAGGTGCAGGCTGCCCTGTCAGTGAGCCAGGAAAATCCAGAGA 
TGGAGGGC CCTG AG CGAGAC CAGCTTGT CATC C C CGACGGACAGGAGGAGGAGCAGGAAGCT 
GC CGGGGAAGGGAG AAAC CAGCAG AAACTG AGAGGAGAAGATGACTACAACATGGATG AAAA 
TGAAGCAGAATCTGAGACAGACAAGCAAGCAGCCCTGGCAGGGAATGACAGAAACATAGATG 
TTTTTAATGTTGAAGATCAGAAAAGAGACACCATAAATTTACTTGATCAGCGTGAAAAGCGG 
AATC ATAC ACT CTQAATTG AACTGGAATCACATATTTCACAACAGGGC CG AAGAGATGACTA 
TAAAATGTTC^TGAGGGACTGAATACTGAAAACTGTGA 
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MMGLGNGRRSMKSPPLVLAALVACI I VLGFNYWI AS SRSVDLQTRIMELEGRVRRAAAERGA 
VELKKNEFQGELEKQREQLDKIQSSHNFQLESVNKLYQDEKAVLVNNITTGERLIRVLQDQL 
KTLQRNYGRLQQDVLQ FQ KNQTNLERKF S YDLSQ C I NQMKE VKEQC EER I EE VTKKGNEAVA 
SRDLSENNDQRQQLQALSEPQPRLQAAGLPHTEVPQGKGNVLGNSKSQTPAPSSEWLDSKR 
QVEKEETNEIQWNEEPQRDRLPQEPGREQWEDRPVGGRGFGGAGELGQTPQVQAALSVSQ 
ENPEMEGPERDQLVIPDGQEEEQEAAGEGRNQQKLRGEDDYNMDENEAESETDKQAALAGND 
RN I D VFNVEDQ KRDT I NLLDQRE KRNHTL 

Signal peptide: 

amino ac ids 1-29 
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GGATGCAGAAAGCCTCAGTGTTGCTCTTCCTGGCCTGGGTCTGCTTCCTCTTCTACGCTGGCATTGCCCTCTTCA 
CCAGTGGCTTCCTGCTCACCCGTTTGGAGCTCACCAACCATAGCAGCTGCCAAGAGCCCCCAGGCCCTGGGTCCC 
TGCCATGGGGGAGCCAAGGGAAACCTGGGGCCTGCTGGATGGCTTCCCGATTTTCGCGGGTTGTGTTGGTGCTGA 
TAGATGCTCTGCGATTTGACTTCGCCCAGCCCCAGCATTCACACGTGCCTAGAGAGCCTCCTGTCTCCCTACCCT 
TCCTGGGCAAACTAAGCTCCTTGCAGAGGATCCTGGAGATTCAGCCCCACCATGCCCGGCTCTACCGATCTCAGG 

ttgaccctcctaccaccaccatgcagcgcctcaaggccctcaccactggctcactgcctacctttattgatgctg 
gtagtaacttcgccagccacgccatagtggaagacaatctcattaagcagctcaccagtgcaggaaggcgtgtag 
tcttcatgggagatgatacctggaaagaccttttccctggtgctttctccaaagctttcttcttcccatccttca 
atgtc^gagacctagacacagtggacaatggcatcctggaacacctctaccccaccatggacagtggtgaatggg 
acgtgctgattgctcacttcctgggtgtggaccactgtggccacaagcatggccctcaccaccctgaaatggcca 
agaaacttagccagatggaccaggtgatccagggacttgtggagcgtctggagaatgacacactgctggtagtgg 
ctggggaccatgggatgaccacaaatggagaccatggaggggacagtgagctggaggtctcagctgctctctttc 
tgtatagccccacagcagtcttccccagcaccccaccagaggagccagaggtgattcctcaagttagccttgtgc 
ccacgctggccctgctgctgggcctgcccatcccatttgggaatatcggggaagtgatggctgagctattctcag 
ggggtgaggactcccagccccactcctctgctttagcccaagcctcagctctccatctcaatgctcagcaggtgt 
cccgatttcttcatacctactcagctgctactcaggaccttcaagctaaggagcttcatcagctgcagaacctct 
tctccaaggcctctgctgactaccagtggcttctccagagccccaagggggctgaggcgacactgccgactgtga 
ttgctgagctgcagcagttcctgcggggagctcgggccatgtgcatcgagtcttgggctcgtttctctctggtcc 
gcatggcggggggtactgctctcttggctgcttcctgctttatctgcctgctggcatctcagtgggcaatatccc 
caggctttccattctgccctctactcctgacacctgtggcctggggcctggttggggccatagcgtatgctggac 
tcctgggaactattgagctgaagctagatctagtgcttctaggggctgtggctgcagtgagctcattcctccctt 
ttctgtggaaagcctgggctggctgggggtccaagaggcccctggcaaccctgtttcccatccctgggcccgtcc 
tgttactcctgctgtttcgcttggctgtgttcrrtctctgatagttttgttgtagctgaggccagggccaccccct 
tccttttgggctcattcatc ctgc tcctggttgtccagcttcactggg aggg c cagctgc ttc cac ctaagctac 
tcacaatgccccgccttggcacttcagccacaacaaaccccccacggcacaatggtgcatatgccctgaggcttg 
gaattgggttgcttttatgtacaaggctagctgggctttttcatcgttgccctgaagagacacctgtttgccact 
cctctccctggctgagtcctctggcatccatggtgggtggtcgagccaagaatttatggtatggagcttgtgtgg 
cggcgctggtggccctgttagctgccgtgcgcttgtggcttcgccgctatggtaatctcaagagccccgagccac 
ccatgctctttgtgcgctggggactgcccctaatggcattgggtactgctgcctactgggcattggcgtcggggg 
cagatgaggctcccccccgtctccgggtcctggtctctggggcatccatggtgctgcctcgggctgtagcagggc 
tggctgcttcagggctcgcgctgctgctctggaagcctgtgacagtgctggtgaaggctggggcaggcgctccaa 
ggaccaggactgtcctcactcccttctcaggcccccccacttctcaagctgacttggattatgtggtccctcaaa 
tctaccgacacatgcaggaggagttccggggccggttagagaggaccaaatctcagggtcccctgactgtggctg 
cttatcagttggggagtgtctactcagctgctatggtcacagccctcaccctgttggccttcccacttctgctgt 
tgcatgcggagcgcatcagccttgtgttcctgcttctgtttctgcagagcttccttctcctacatctgcttgctg 
ctgggatacccgtcaccacccctggtccttttactgtgccatggcaggcagtctcggcttgggccctcatggcca 
cacagaccttctactccacaggccaccagcctgtctttccagccatccattggcatgcagccttcgtgggattcc 
cagagggtcatggctcctgtacttggctgcctgctttgctagtgggagccaacacctttgcctcccacctcctct 
ttgcagtaggttgcccactgctcctgctctggcctttcctgtgtgagagtcaagggctgcggaagagacagcagc 
ccccagggaatgaagctgatgccagagtcagacccgaggaggaagaggagccactgatggagatgcggctccggg 
atgcgcctcagcacttctatgcagcactgctgcagctgggcctcaagtacctctttatccttggtattcagattc 
tggcctgtgccttggcagcctccatccttcgcaggcatctcatggtctggaaagtgtttgcccctaagttcatat 
ttgaggctgtgggcttcattgtgagcagcgtgggacttctcctgggcatagctttggtgatgagagtggatggtg 
ctgtgagctcctggttgaggcagctatttctggcccy^ 

acagagagtgctggagaacagtgtagcctggcctgtacaggtactggatgatctgcaagacaggctcagccatac 
tcttactatcatgcagccaggggccgctgacatctaggacttcattattctataattcaggaccacagtggagta 
tgatccctaactcctgatttggatgcatctgagggacaaggggggcggtctccgaagtggaataaaataggccgg 
gcgtggtgacttgcacctataatcccagcactttgggaggcagaggtgggaggattgcttggtcccaggagttca 
agaccagcctgtggaacataacaagaccccgtctctactatttaaaaaaaagtgtaa:ta;^aatgataatat 



WO 00/12708 



PCT/US99/2011I 



FIGURE 60 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA62 8 09 
<subunit 1 of 1, 1089 aa, 1 stop 
<MW: 118699, pi: 8.49, NX(S/T): 2 

MQKAS VLLFLAWVCFL F YAGI AL FTSGFLLTRLELTNHS S CQE PPGPGS LPWGSQGKPGACW 
MASRFSRWLVLIDALRFDFAQPQHSHVPREPPVSLPFLGKLSSLQRILEIQPHHARLYRSQ 
VDPPTTTMQRLKALTTGSLPTF I DAGSNFASHAI VEDNL I KQLTS AGRR WFMGDDTWKDLF 
PGAFSKAFFFPSFNVRDLDTVDNGILEHLYPTMDSGEWDVLIAHFLGVDHCGHKHGPHHPEM 
AKKLSQMDQVIQGLVERLENDTLLWAGDHGMTTNGDHGGDSELEVSAALFLYSPTAVFPST 
PPEEPEVI PQVSLVPTLALLLGLPIPFGNIGEVMAELFSGGEDSQPHSSALAQASALHLNAQ 
QVSRFLHTYSAATQDLQAKELHQLQNLFSKASADYQWLLQSPKGAEATLPTVIAELQQFLRG 
ARAMCIESWARFSLVRMAGGTALLAASCFICLLASQWAISPGFPFCPLLLTPVAWGLVGAIA 
YAGLLGT I ELKLDLVLLGAVAAVSS FL PFLWKAWAGWGS KRPLATLFP I PGPVLLLLLFRLA 
VFFSDSFWAEARATPFLLGSFILLLWQLHWEGQLLPPKLLTMPRLGTSATTNPPRHNGAY 
ALRLG I GLLL CTRLAGLFHRCPEETPVCHSS PWLS PLASMVGGRAKNLW YGAC VAAL VALLA 
AVR.LWLRRYGNLKSPEPPMLFVRWGLPLMALGTAAYWALASGADEAPPRLRVLVSGASMVLP 
RAVAGLAASGIJVLLLWKPVTVLVKAGAGAPRTRTVLTPFSGPPTSQADLDYWPQIYRHMQE 
EFRGRLERTKSQGPLTVAAYQLGSVYSAAMVTALTLLAFPLLLLHAER I SLVFLLLFLQS FL 
LLHLLAAGIPVTTPGPFTVPWQAVSAWALMATQTFYSTGHQPVFPAIHWHAAFVGFPEGHGS 
CTWLPALLVGANTFASHLLFAVGCPLLLLWPFLCESQGLRKRQQPPGNEADARVRPEEEEEP 
LMEMRLRDAPQHFYAALLQLGLKYLF ILGIQILACALAAS ILRRHLMVWKVFAPKF I FEAVG 
F I VSSVGLLLGI ALVMRVDGAVSSWFRQLFLAQQR 



Important features: 
Signal peptide: 

amino acids 1-16 



Transmembrane domains: 

amino acids 317-341, 451-470, 481-500, 510-527, 538-555, 831-850, 
1016-1034, 1052-1070 



Leucine zipper pattern. 

amino acids 843-864 



N-glycosylation sites . 

amino acids 37-40, 268-271 
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TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGT 
GTCTCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGT 
CCTACAATGGACTCCACCAGAGGGTCTTCAAGGAGTTAAAGTTACTTACACTGTGCAGTATT 
TCAT C ACAAATTGG CC CACCAG AGGTGG CACTGACT AC AG ATGAGAAG TC CATTT CTGTTGT 
C CTGACAGCT CCAGAG AAGTGGAAGAG AAAT C CAGAAG AC CTTC CTGTTT C CATGCAACAAA 
TATACTCCAATCTGAAGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCC 
CAGTGTGTGACCAACCACACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGT 
ACACGTGGAGTCCTTCGTCCCAGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTG 
CCAGGACTTTGAAAGATCAATCATCAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTG 
CCCATATCTATTACCGTGTTTCTTTTTTCTGTGATGGGCTATTCCATCTACCGATATATCCA 
CGTTGG CAAAGAGAAACAC C CAG CAAATTTGATTTTGATTTATGGAAATG AATTTGAC AAAA 
GATT CTTTGTGC CTGCTGAAAAAATCGTGATTAACTTT ATCACC CTCAATAT CTCGGATGAT 
TCTAAAATTTCT CATCAGGATATGAGTTTACTGGGAAAAAG CAG TGATGTAT CC AGCCTT AA 
TGAT CCT C AG CCC AGCGGGAACCTGAGG C C CC CT CAGGAGGAAGAGG AGG TG AAACATTT AG 
GGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAGAAAACACGGAAGGTACTTCT 
CTCACCCAGCAAGAGT CC CTCAGCAGAACAATACCCCCGGAT AAAACAGTCATTGAATATGA 
ATATGATGTCAGAACCACTGACATTTGTGCGGGGCCTGAAGAGCAGGAGCTCAGTTTGCAGG 
AGGAGGTGTCCACACAAGGAACATTATTGGAGTCGCAGGCAGCGTTGGCAGTCTTGGGCCCG 
CAAACGTTAC AGTACT CATACACC CCTC AGCT CC AAGACTTAGAC CCC CTGG CGCAGGAG C A 
CACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGATCCCC 
AAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCGAG 
CCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGC 
T C CAGACAGG CCACC AGGAGAAAATGAAACCT AT CTCATGCAATTCATGGAGGAAT GGGGGT 
TATATGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAAC 
AAGTGAGTCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGT 
TTGTCAGTGTCTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTG 
GTTCATGCATGTAGGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGT 
TGTT CTATG CAGAG AAAG CAGT C AAT AAATGTTTGC CAGACTGGGTGC AG AATTTATTCAGG 
TGGGTGT 



• 
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</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA62815 
oubunit 1 of 1, 442 aa, 1 stop 
<MW: 49932, pi: 4.55, NX(S/T): 5 

MS YNGLHQRVFKELKLLTLCS I SSQ I GP PEVALTTDEKS I S WLTAPEKWKRNPEDLPVSMQ 
QIYSNLKY1WSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQ 
CARTLKDQSSEFKAKI I FWYVLP I S I TVFLFS VMGYS I YRY I HVGKEKHPANL IL I YGNEFD 
KRFFVPAEKIVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKH 
LGYASHLMEIFCDSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSL 
QEEVSTQGTLLESQAALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWD 
PQTGRLCIPSLSSFDQDSEGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEW 
GLYVQMEN 

Important features : 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FTQtJRE 63 



CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGTCTCTGCGGGGAGACGCCAGCCTGCG 
TCTGCCATGGGGCTCGGGTTGAGGGGCTGGGGACGTCCTCTGCTGACTGTGGCCACCGCCCT 
GATGCTGCCCGTGAAGCCCCCCGCAGGCTCCTGGGGGGCCCAGATCATCGGGGGCCACGAGG 
TGACCCCCCACTCCAGGCCCTACATGGCATCCGTGCGCTTCGGGGGCCAACATCACTGCGGA 
GGCTTCCTGCTGCGAGCCCGCTGGGTGGTCTCGGCCGCCCACTGCTTCAGCCACAGAGACCT 
CCGCACTGGCCTGGTGGTGCTGGGCGCCCACGTCCTGAGTACTGCGGAGCCCACCCAGCAGG 
TGTTTGGCATCGATGCTCTCACCACGCACCCCGACTACCACCCCATGACCCACGCCAACGAC 
ATCTGCCTGCTGCGGCTGAACGGCTCTGCTGTCCTGGGCCCTGCAGTGGGGCTGCTGAGGCT 
GCCAGGGAGAAGGGCCAGGCCCCCCACAGCGGGGACACGGTGCCGGGTGGCTGGCTGGGGCT 
TCGTGTCTGACTTTGAGGAGCTGCCGCCTGGACTGATGGAGGCCAAGGTCCGAGTGCTGGAC 
CCGGACGTCTGCAACAGCTCCTGGAAGGGCCACCTGACACTTACCATGCTCTGCACCCGCAG 
TGGGGACAGCCACAGACGGGGCTTCTGCTCGGCCGACTCCGGAGGGCCCCTGGTGTGCAGGA 
ACCGGGCTCACGGCCTCGTTTCCTTCTCGGGCCTCTGGTGCGGCGACCCCAAGACCCCCGAC 
GTGTACACGCAGGTGTCCGCCTTTGTGGCCTGGATCTGGGACGTGGTTCGGCGGAGCAGTCC 
CCAGCCCGGCCCCCTGCCTGGGACCACCAGGCCCCCAGGAGAAGCCGCCTGAGCCACAACCT 
TGCGGCATGCAAATGAGATGGCCGCTCCAGGCCTGGAATGTTCCGTGGCTGGGCCCCACGGG 
AAGCCTGATGTTCAGGGTTGGGGTGGGACGGGCAGCGGTGGGGCACACCCATTCCACATGCA 
AAGGGCAGAA.GCAAACCCAGTAAAATGTTAACTGACAAAAAAAAAAAAAAAAAAAAGAAA 



# 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA62845 
xsubunit 1 of 1, 283 aa f 1 stop 
><MW: 30350, pi: 9.66, NX(S/T): 2 

MGLGLRGWGRPLLTVATALMLPVKPPAGSWGAQIIGGHEVTPHSRPYMASVRFGGQHHCGGF 
LLRARWWSAAHCFSHRDLRTGLWLGAHVLSTAEPTQQVFGIDALTTHPDYHPMTHANDIC 
LLRLNGSAVLGPAVGLLRLPGRRARPPTAGTRCRVAGWGFVSDFEELPPGLMEAKVRVLDPD 
VCNSSWKGHLTLTMLCTRSGDSHRRGFCSADSGGPLVCRNRAHGLVSFSGLWCGDPKTPDVY 
TQVSAFVAW I WDWRRS S PQ PGPLPGTTRP PGE AA 

Signal peptide: 

amino acids 1-30 
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GAGCTACCCAGGCGGCTGGTGTGCAGCAAGCTCCGCGCCGACTCCGGACGCCTGACGCCTGA 
CGCCTGTCCCCGGCCCGGCATGAGCCGCTACCTGCTGCCGCTGTCGGCGCTGGGCACGGTAG 
CAGGCGCCGCCGTGCTGCTCAAGGACTATGTCACCGGTGGGGCTTGCCCCAGCAAGGCCACC 
ATCCCTGGGAAGACGGTCATCGTGACGGGCGCCAACACAGGCATCGGGAAGCAGACCGCCTT 
GGAACTGGC C AGGAGAGGAGGC AACATC AT CCTGGC CT G C CGAGACATGG AGAAGTGTGAGG 
CGGCAGCAAAGGACATCCGCGGGGAGACCCTCAATCACCATGTCAACGCCCGGCACCTGGAC 
TTGGCTTCCCTCAAGTCTATCCGAGAGTTTGCAGCAAAGATCATTGAAGAGGAGGAGCGAGT 
GGACATTCTAATCAACAACGCGGGTGTGATGCGGTGCCCCCACTGGACCACCGAGGACGGCT 
TCGAGATGCAGTTTGGCGTTAACCACCTGGGTCACTTTCTCTTGACAAACTTGCTGCTGGAC 
AAGCTGAAAGCCTCAGCCCCTTCGCGGATCATCAACCTCTCGTC CCTGGC CCATGTTGCTGG 
GCAC ATAGACTTTGACGACT TGAACTGG CAGACG AGG AAGTATAACACCAAAGCCG CC TACT 
GCCAGAGCAAGCTCGCCATCGTCCTCTTCACCAAGGAGCTGAGCCGGCGGCTGCAAGGCTCT 
GGTGTGAC TGTC AACG CC CTGCACCC CGG CGTGG C CAGGACAGAGCTGGG CAGACACACGGG 
CATCCATGGCTCCACCTTCTCCAGCACCACACTCGGGCCCATCTTCTGGCTGCTGGTCAAGA 
GCCCCGAGCTGGCCGCCCAGCCCAGCACATACCTGGCCGTGGCGGAGGAACTGGCGGATGTT 
TC CGGAAAGTAC TT CG ATGGACT CAAACAGAAGGCCCCGG CC CC CG AGG CTGAGGATG AGG A 
GGTGGCCCGGAGGCTTTGGGCTGAAAGTGCCCGCCTGGTGGGCTTAGAGGCTCCCTCTGTGA 
GGGAGCAG CC CCTC CC C AGATAACCT CTGGAGC AGATTTGAAAGCCAGGATGG CGC CTCC AG 
ACCGAGGACAGCTGTCCGCCATGCCCGCAGCTTCCTGGCACTACCTGAGCCGGGAGACCCAG 
GACTGGCGGCCGCCATGCCCGCAGTAGGTTCTAGGGGGCGGTGCTGGCCGCAGTGGACTGGC 
CTGCAGGTGAGCACTGCCCCGGGCTCTGGCTGGTTCCGTCTGCTCTGCTGCCAGCAGGGGAG 
AGGGGCCATCTGATGCTTCCCCTGGGAATCTAAACTGGGAATGGCCGAGGAGGAAGGGGCTC 
TGTGCACTTGCAGGCCACGTCAGGAGAGCCAGCGGTGCCTGTCGGGGAGGGTTCCAAGGTGC 
TCCGTGAAGAGCATGGGCAAGTTGTCTGACACTTGGTGGATTCTTGGGTCCCTGTGGGACCT 
TGTGCATGCATGGTCCTCTCTGAGCCTTGGTTTCTTCAGCAGTGAGATGCTCAGAATAACTG 
CTGTCTCCCATGATGGTGTGGTACAGCGAGCTGTTGTCTGGCTATGGCATGGCTGTGCCGGG 
GGTGTTTGCTGAGGGCTTCCTGTGCCAGAGCCCAGCCAGAGAGCAGGTGCAGGTGTCATCCC 
GAGTTCAGGCTCTGCACGGCATGGAGTGGGAACCCCACCAGCTGCTGCTACAGGACCTGGGA 
TTGCCTGGGACTCCCACCTTCCTATCAATTCTCATGGTAGTCCAAACTGCAGACTCTCAAAC 
TTGCTCATTT 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA64842 
xsubunit 1 of 1, 331 aa, 1 stop 
><MW: 35932, pi: 8.45, NX(S/T): 1 

MSRYLLPLSALGTVAGAAVLLKDYVTGGAC PS KAT I PGKTVI VTGANTG I GKQTALELARRG 
GNI I LACRDME KCEAAAKD I RGETLNHHVNARHLDLASLKS I REFAAK I IEEEERVDILINN 
AGVMRCPHWTTEDGFEMQFGVNHLGHFLLTNLLLDKLKASAPSRIINLSSLAHVAGHIDFDD 
LNWQTRKYNTKAAYCQSKLAIVLFTKELS 

S S TTLGP I FWLLVKS PELAAQ P S T YIxAVAEELADVSGKYFDGLKQKAP APEAEDEE VARRLW 
AESARLVGLEAPSVREQPLPR 

Signal peptide: 

amino acids 1-17 



£.6/1-7 o 
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GAAGTTCGCGAGCGCTGGCATGTGGTCCTGGGGCGCGGCTGGCGGCGCTGCTGGCGGTGCTG 

GCGCTCGGGACAGGAGACCCAGAAAGGGCTGCGGCTCGGGGCGACACGTTCTCGGCGCTGAC 

CAGCGTGGCGCGCGCCCTGGCGCCCGAGCGCCGGCTGCTGGGGCTGCTGAGGCGGTACCTGC 

GCGGGGAGGAGGCGCGGCTGCGGGACCTGACTAGATTCTACGACAAGGTACTTTCTTTGCAT 

GAGGATTCAACAACCCCTGTGGCTAACCCTCTGCTTGCATTTACTCTCATCAAACGCCTGCA 

GTCTGACTGGAGG AAT GTGGTACATAGT CTGG AG GC CAGT GAGAAC AT C CGAG CT CTGAAGG 

ATGGCTATGAGAAGGT GG AG CAAGACCTTC CAGCCTTTGAGGAC CTTGAGGGAGCAGCAAGG 

GCCCTGATGCGGCTGCAGGACGTGTACATGCTCAATGTGAAAGGCCTGGCCCGAGGTGTCTT 

TCAGAGAGTCACTGGCTCTGCCATCACTGACCTGTACAGCCCCAAACGGCTCTTTTCTCTCA 

CAGGGGATGACTGCTTCCAAGTTGGCAAGGTGGCCTATGACATGGGGGATTATTACCATGCC 

ATTCCATGGCTGGAGGAGGCTGTCAGTCTCTTCCGAGGATCTTACGGAGAGTGGAAGACAGA 

GGATGAGGCAAGTCTAGAAGATGCCTTGGATCACTTGGCCTTTGCTTATTTCCGGGCAGGAA 

ATGTTTCGTGTGCCCTCAGCCTCTCTCGGGAGTTTCTTCTCTACAGCCCAGATAATAAGAGG 

ATGGCCAGGAATGTCTTGAAATATGAAAGGCTCTTGGCAGAGAGCCCCAACCACGTGGTAGC 

TGAGGCTGTCAT CC AGAGGC CCAATATACC CCACCTGCAGACCAGAGACACCTACGAGGGGC 

TATGTCAGACCCTGGGTTCCCAGCCCACTCTCTACCAGATCCCTAGCCTCTACTGTTCCTAT 

GAGACCAATTCCAACGCCTACCTGCTGCTCCAGCCCATCCGGAAGGAGGTCATCCACCTGGA 

GC C CT ACATTGCT CT CTACCATGACTT CGT CAGTGACT CAGAGG CTCAGAAAATTAGAGAAC 

TTGCAG AACCATGG CT ACAGAGGTCAGTGGTGG C AT CAGGGG AG AAGCAGTT AC AAGTGGAG 

TACCGCATCAGCAAAAGTGCCTGGCTGAAGGACACTGTTGACCCAAAACTGGTGACCCTCAA 

CCACCGCATTGCTGCCCTCACAGGCCTTGATGTCCGGCCTCCCTATGCAGAGTATCTGCAGG 

TGGTGAACTATGGC AT CGGAGGAC ACTATG AG CCTCAC TTTGAC CATGCTACGTCACCAAGC 

AGC CCC CT CTACAGAATGAAGT CAGGAAAC CG AGTTGCAACATTTATGAT CTATCTGAGCTC 

GGTGGAAGCTGGAGGAGCCACAGCCTTCATCTATGCCAACCTCAGCGTGCCTGTGGTTAGGA 

ATGCAG CACTGTTTTGGTGG AAC CTGCACAGG AGTGGTGAAGGGGAGAGTGACAGACTTC^ 

GCTGGCTGTCCTGTCCTGGTGGGAGATAAGTGGGTGGCCAACAAGTGGATACATGAGTATGG 

ACAGGAATTC CGCAGACC CTGCAGCTCCAGCCCTGAAGACTGAACTGTTGGCAGAGAGAAGC 

TGGTGG AGT CCTGTGGCTTT C C AG AGAAGC CAGGAG C CAAAAGCTGGGGTAGGAGAGGAG AA 

AGCAGAGCAGCCTCCTGGAAGAAGGCCTTGTCAGCTTTGTCTGTGCCTCGCAAATCAGAGGC 

AAGGGAGAGGTTGTTAC CAGGGGACACTGAGAATGTACATTTGATCTGCCC CAGC CACGGAA 

GTC AGAGT AGGATG C ACAGT ACAAAGGAGGGGGGAGTGGAGGCCTGAGAGGG AAGTTT CTGG 

AGTTCAGATACTCTCTGTTGGGAACAGGACATCTCAACAGTCTCAGGTTCGATCAGTGGGTC 

TTTTGG CACTTTGAAC CTTGAC CACAGGGAC CAAGAAGTGGC AATG AGGACAC CTG CAGGAG 

GGGCTAGCCTGACTCCCAGAACTTTAAGACTTTCTCCCCACTGCCTTCTGCTGCAGCCCAAG 

CAGGG AGTGTC CCCCT CCCAGAAGCATATCC CAGATGAGTGGTACATT AT AT AAGGATTTTT 

TTTAAGTTGAAAACAACTTTCTTTTCTTTTTGTATGATGGTTTTTTAAC^CAGTCATTAAAA 

ATGTTTATAAATCAAAA 
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MGPGARIAALLAVLALGTGDPERAAARGDTFSALTSVARALAPERRLLGLLRRYLRGEEARL 
RDLTRFYDKVLSLHEDSTTPVANPLLAFTLIKRLQSDWRNWHSLEASENIRALKDGYEKVE 
QDLPAFEDLEGAARALMRLQDVYMLNVKGLARGVFQRVTGSAITDLYSPKRLFSLTGDDCFQ 
VGKVAYDMGDYYHAIPWLEEAVSLFRGSYGEWKTEDEASLEDALDHLAFAYFRAGNVSCALS 
LSREFLLYSPDNKRMARNVLKYERLLAESPNHVVAEAVIQRPNIPHLQTRDTYEGLCQTLGS 
QPTLYQ I PSLYCSYETNSNAYLLLQP IRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQ 
RSWASGEKQLQVEYRISKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIG 
GHYE PHFDHATS P S S PLYRMKSGNRVATFM I YLSSVEAGGATAF I YANLS VP WRNAALFWW 
NLHRSGEGDSDTLHAGCPVLVGDKWVANKWI HE YGQEFRRP CS S S PED 

Signal peptide: 

amino acids 1-19 
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GAGATAGGGAGTCTGGGTTTAAGTTCCTGCTCCATCTCAGGAGCCCCTGCTCCCACCCCTAG 
GAAGCCACCAGACTCCACGGTGTGGGGCCAATCAGGTGGAATCGGCCCTGGCAGGTGGGGCC 
ACGAGCG CTGGCTGAGGG AC CG AGCCGGAGAG C C CCGGAG CC C C CGT AAC C CG CGCGGGGAG 
CGCCCAGGATGCCGCGCGGGGACTCGGAGCAGGTGCGCTACTGCGCGCGCTTCTCCTACCTC 
TGGCTCAAGTTTTCACTTATCATCTATTCCACCGTGTTCTGGCTGATTGGGGCCCTGGTCCT 
GTCTGTGGGCATCTATGCAGAGGTTGAGCGGCAGAAATATAAAACCCTTGAAAGTGCCTTCC 
TGGCTCCAGCCATCATCCTCATCCTCCTGGGCGTCGTCATGTTCATGGTCTCCTTCATTGGT 
GTGCTGGCGTCCCTCCGTGACAACCTGTACCTTCTCCAAGCATTCATGTACATCCTTGGGAT 
CTGCCTCATCATGGAGCTCATTGGTGGCGTGGTGGCCTTGACCTTCCGGAACCAGACCATTG 
ACTT CCTGAACG ACAACATT CGAAGAGGAATTGAGAACTACT ATGATG AT CTGGACTTCAAA 
AACATCATGGACTTTGTTCAGAAAAAGTTCAAGTGCTGTGGCGGGGAGGACTACCGAGATTG 
GAGCAAGAATCAGTACCAGGACTGCAGTGCCCCTGGACCCCTGGCCTGTGGGGTGCCCTACA 
CCTGCTGCATCAGGAACACGACAGAAGTTGTCAACACCATGTGTGGCTACAAAACTATCGAC 
AAGGAGCGTTTCAGTGTGCAGGATGTCATCTACGTGCGGGGCTGCACCAACGCCGTGATCAT 
CTGGTTCATGGACAACTACACCATCATGGCGTGCATCCTCCTGGGCATCCTGCTTCCCCAGT 
TCCTGGGGGTGCTGCTGACGCTGCTGTACATCACCCGGGTGGAGGACATCATCATGGAGCAC 
TCTGTCACTGATGGGCTCCTGGGGCCCGGTGCCAAGCCCAGCGTGGAGGCGGCAGGCACGGG 
ATGCTGCTTGTGCTACCCCAATTMGGCCCAGCCTGCC^TGGCAGCTCCAAC^GGACCGTC 
TGGGATAGCACCTCTCAGTCAACATCGTGGGGCTGGACAGGGCTGCGGCCCCTCTGCCCACA 
CTCAGTACTGACCAAAGCCAGGGCTGTGTGTGCCTGTGTGTAGGTCCCACGGCCTCTGCCTC 
CCCAGGGAGCAGAGCCTGGGCCTCCCCTAAGAGGCTTTCCCCGAGGCAGCTCTGGAATCTGT 
GCCCACCTGGGGCCTGGGGAACAAGGCCCTCCTTTCTCCAGGCCTGGGCTACAGGGGAGGGA 
GAGCCTGAGGCTCTGCTCAGGGCCCATTTCATCTCTGGCAGTGCCTTGGCGGTGGTATTCAA 
GGCAGTTTTGTAGCACCTGTAATTGGGGAGAGGGAGTGTGCCCCTCGGGGCAGGAGGGAAGG 
GCATCTGGGGAAGGGCAGGAGGGAAGAGCTGTCCATGCAGCCACGCCCATGGCCAGGTTGGC 
CTCTTCTCAGCCTCCCAGGTGCCTTGAGCCCTCTTGCAAGGGCGGCTGCTTCCTTGAGCCTA 
GTTTTTTTTTACGTGATTTTTGTAACATTCATTTTTTTGTACAGATAACAGGAGTTTCTGAC 
TAATCAAAGCTGGTATTTCCCCGCATGTCTTATTCTTGCCCTTCCCCCAACCAGTTTGTTAA 
TCAAACAATAAAAACATGTTTTGTTTTGTTTTTAAAAAAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA64863 
xsubunit 1 of 1, 294 aa, 1 stop 
><MW: 33211, pi: 5.35, NX(S/T): 3 

MPRGDSEQVR YCARFS YLWLKF SL 1 1 YS TVFWL I GALVLS VG I YAE VERQKYKTLES AFLAP 
A 1 1 L I LLG WMFMVS F I GVLASLRDNL YLLQAFMYI LG I CL I MEL I GGWALTFRNQT I DFL 
NDNIRRGIENYYDDLDFKNIMDFVQKKFKCCGGEDYRDWSKNQYHDCSAPGPLACGVPYTCC 
I RNTTEWNTMCGYKT IDKERFSVQDVI YVRGCTNAVI I WFMDNYT IMACI LLGI LLPQFLG 
VLLTLLYITRVEDIIMEHSVTDGLLGPGAKPSVEAAGTGCCLCYPN 

Signal peptide : 

amino acids 1-44 

Transmembrane domains : 

amino acids 22-42, 57-85, 93-116, 230-257 
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GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGA 
CACCTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACC 
TTGATCCAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGA 
AAAGCTGACACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGC 
TCAGTGCCATGCGGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACC 
GTCCTGAAGCACATCATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAA 
GCCCTCGGCCAATGACCAGGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCA 
ACACGCCCCTGGTCAAGACCATCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATC 
CGCATGGACACCAGTGCAAGTGGCCCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCA 
TGGGAGCCTGCGCATCCAACTGCTGTATAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGC 
AGGTCATGAACCTCCTAGTGCCATCCCTGCCCAATCTAGTGAAAAACCAGCTGTGTCCCGTG 
ATCGAGGCTTCCTTCAATGGCATGTATGCAGACCTCCTGCAGCTGGTGAAGGTGCCCATTTC 
CCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTATCCTGC CATC AAGGGTGACACC ATT C 
AGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAAAGGTGACCAAGTGGTTCAATAAC 
TCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCGTTCAGCCTCATCGTGAGTCA 
GGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGAATTCATGGTCCTGTTGG 
ACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGCTGATCAATGAAAAG 
GCTG CAGATAAG CT GGGAT CTACC CAGATCGTGAAGAT CCTAACTCAGGACACT CCCGAGTT 
TTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTTTCCCTCCA 
GTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGTTTTAC 
ACCAAAGGTGACCAACTTATACTCAACTTGAATAACATCAGCTCTGATCGGATCCAGCTGAT 
GAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCC 
ACTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTG 
AAGGCCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCC 
AGCCTCCTTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAG 
GGAAGGCTGGGTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAAT 
CAATAAACACTTGCCTGTGAAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA64881 
xsubunit 1 of 1, 484 aa, 1 stop 
><MW: 52468, pi: 7.14, NX(S/T): 3 

MAGPWTFTLLCGLLAATL IQATLS PTAVLI LGPKVI KEKLTQELKDHNATS ILQQLPLLSAM 
REKPAGGI P VLGSLVNTVLKH I IWLKVITANILQLQVKPSANDQELLVKIPLDMVAGFNTPL 
VKTIVEFHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVI^ 
LLVPSLPNLVKNQLCPVIEASFNGMYADLLQLVKVPISLSIDRLEFDLLYPAIKGDTIQLYL 
GAKLLDSQGKVTKWFNNSAASLTMPTLDNIPFSLIVSQDVVKAAVAAVLSPEEF1WLLDSVL 
PESAHRLKSSIGLINEKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEAb 
RPLFTLGIEASSEAQFYTKGDQLILNrJSTNISSDRIQLMNSGIGWFQPDVLKNIITEIIHSIL 
LPNQNGKLRSGVPVSLVKALGFEAAESSLTKDALVLTPASLWKPSSPVSQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites* 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 



XL/*-?* 



• 
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GAGCGAACATGGCAGCGCGTTGGCGGTTTTGGTGTGTCTCTGTGACCATGGTGGTGGCGCTG 
CTCATCGTTTGCGACGTTCCCTCAGCCTCTGCCCAAAGAAAGAAGGAGATGGTGTTATCTGA 
AAAGGTTAGTCAGCTGATGGAATGGACTAACAAAAGACCTGTAATAAGAATGAATGGAGACA 
AGTTCCGTCGCCTTGTGAAAGCCCCACCGAGAAATTACTCCGTTATCGTCATGTTCACTGCT 
CTCCAACTGCATAGACAGTGTGTCGTTTGCAAGCAAGCTGATGAAGAATTCCAGATCCTGGC 
AAACTC CTGGCGATAC TC CAGTGC ATT CAC CAA C AGGATATTTT TTG C CATGGTGGATTTTG 
ATGAAGGCTCTGATGTATTTCAGATGCTAAACATGAATTCAGCTCCAACTTTCATCAACTTT 
CCTGCAAAAGGGAAACCCAAACGGGGTGATACATATGAGTTACAGGTGCGGGGTTTTTCAGC 
TGAG CAGATTGC C CGGTGGATCGC CG AC AG AACTGATGTC AATATTAGAGTGATTAGAC CCC 
CAAATTATGCTGGTCCCCTTATGTTGGGATTGCTTTTGGCTGTTATTGGTGGACTTGTGTAT 
CTTCGAAGAAGTAATATGGAATTTCTCTTTAATAAAACTGGATGGGCTTTTGCAGCTTTGTG 
TTTTGTG CTTGC T ATGAC AT CTGGTC AAATGTGGAAC CATATAAGAGGACC ACC AT ATG CCC 
ATAAGAAT C C CC ACACGGGACATGTG AATTATAT C CATGG AAGCAGT CAAGC CC AGTTTGT A 
GCTGAAACACACATTGTTCTTCTGTTTAATGGTGGAGTTACCTTAGGAATGGTGCTTTTATG 
TGAAGCTGCTACCTCTGACATGGATATTGGAAAGCGAAAGATAATGTGTGTGGCTGGTATTG 
GACTTGTTGTATTATTCTTCAGTTGGATGCTCTCTATTTTTAGATCTAAATATCATGGCTAC 
CCATACAGCTTTCTGATGAGTTAAAAAGGTCCCAGAGATATATAGACACTGGAGTACTGGAA 
ATTGAAAAACGAAAATCGTGTGTGTTTGAAAAGAAGAATGCAACTTGTATATTTTGTATTAC 
CTCTTTTTTTCAAGTGATTTAAATAGTTAATCATTTAACCAAAGAAGATGTGTAGTGCCTTA 
ACAAGCAATCCTCTGTCAAAATCTGAGGTATTTGAAAATAATTATCCTCTTAACCTTCTCTT 
CCCAGTGAACTTTATGGAACATTTAATTTAGTACAATTAAGTATATTATAAAAATTGTAAAA 
CTACTACTTTGTTTTAGTTAGAACAAAGCTCAAAACTACTTTAGTTAACTTGGTCATCTGAT 
TTTAT ATTGCCTTATC CAAAGATGGGGAAAGTAAGT CCTGAC C AGGTGTT C C CACATATGCC 
TGTTACAGATAACT AC ATTAGG AATTCATT CTTAGCTTCTTC AT CTTTGTGTGGATGTGTAT 
ACTTTACGCATCTTTCCTTTTGAGTAGAGAAATTATGTGTGTCATGTGGTCTTCTGAAAATG 
GAACACCATTCTTCAGAGCACACGTCTAGCCCTCAGCAAGACAGTTGTTTCTCCTCCTCCTT 
GCATATTTCCTACTGCGCTCCAGCCTGAGTGATAGAGTGAGACTCTGTCTCAAAAAAAAGTA 
T CT CTAAATACAGG ATT ATAATTT CTG CTTGAGT ATGGTGTT AACTACCTTGTATTTAGAAA 
GATTTCAGATTCATTCCATCTCCTTAGTTTTCTTTTAAGGTGACCCATCTGTGATAAAAATA 
TAGC TT AGTG CT AAAATC AGTGT AAC TT ATACATGGC CTAAAATGTTTCTACAAATTAGAGT 
TTGT CACTTATT C CATTTGTAC CTAAGAGAAAAATAGGCT CAGTTAGAAAAGGACTCCCTGG 
CCAGGCGCAGTGACTTACGCCTGTAATCTCAGCACTTTGGGAGGCCAAGGCAGGCAGATCAC 
GAGGTC AGGAGT T CGAGACCATC CTGGC CAAC ATGGTGAAAC CCCGTCT CTACT AAAAAT AT 
AAAAATTAGCTGGGTGTGGTGGCAGGAGCCTGTAATCCCAGCTACACAGGAGGCTGAGGCAC 
GAGAAT CACTTGAACTCAGGAGATGGAGGTTT CAGTGAGC CG AG AT CACG CC ACTG CACT C C 
AGCCTGGCAACAGAGCGAGACTCCATCTCAAAAAAAAAAAAAA 
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MAARWRFWC^S VT1NTWALL ^ 

RLVKAPPRNYSVIVMFTALQLHRQC\A/CKQADEEFQILANSWRYSSAFTNRIFFAMVDFDEG 
S DVFQMLNMNSAPTF I NFPAKGKPKRGDTYELQVRGFS AEQ I ARWI ADRTDVNI RVI RPPNY 
AGPLMLGLLLAV I GGLVYLRRSNME F LFNKTGWAFAALCF VL AMTS GQMWNH I RGPP YAHKN 
PHTGHVNYIHGSSQAQFVAETHIVLLFNGGWLGMVLLCEAATSDMDIGKRKIMCVAGIGLV 
VLFFSWMLS I FRS KYHGYPYS FLMS 

Signal peptide: 

amino acids 1-29 

Transmembrane domains : 

amino acids 183-205, 217-237, 217-287, 301-321 
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aagcaaccaaactgcaagctttgggagttgttcgctgtccctgccctgctctgctagggaga 
gaacgccagagggaggcggctggcccggcggcaggctctcagaaccgctaccggcgatgcta 
ctgctgtgggtgtcggtggtcgcagccttggcgctggcggtactggcccccggagcagggga 
gcagaggcggagagcagccaaagcgcccaatgtggtgctggtcgtgagcgactccttcgatg 
gaaggttaacatttcatccaggaagtcaggtagtgaaacttccttttatcaactttatgaag 
acacgtgggacttcctttctgaatgcctacacaaactctccaatttgttgcccatcacgcgc 
agcaatgtggagtggcct cttcactcactt aacagaat cttggaataattttaagggtctag 
atccaaattatacaacatggatggatgtcatggagaggcatggctaccgaacacagaaattt 
gggaaactggactatactt c aggacat c ac tc catt agtaat cgtgtggaagcgtggacaag 
agatgttg cttt cttagt cagacaag aagg cagg cc c atggttaat cttatc cgtaac agga 
ct aaagtc ag agtgatggaaagggattgg c ag aat ac agacaaag cagtaaactggtt aaga 
aaggaagcaattaattacactgaaccatttgttatttacttgggattaaatttaccacaccc 
ttacccttcaccatcttctggagaaaattttggatcttcaacatttcacacatctctttatt 
ggcttgaaaaagtgtctcatgatgccatcaaaatcccaaagtggtcacctttgtcagaaatg 
caccctgtagattattactcttcttatacaaaaaactgcactggaagatttacaAaaaaaga 
aattaagaatattagagcattttattatgctatgtgtgctgagacagatgccatgcttggtg 
aaattattttggcccttcatcaattagatcttcttcagaaaactattgtcatatactcctca 
gaccatggagagctggccatggaacatcgacagttttataaaatgagcatgtacgaggctag 
tgcac atgttc cgcttttgatgatgggacc aggaattaaagccggc ct ac aagt at caaatg 
tggtttctcttgtggatatttaccctaccatgcttgatattgctggaattcctctgcctcag 
aac ctgagtggatactctttgttg ccgttat c atcagaaacatttaagaatgaacat aaagt 
caaaaacctgcatccaccctggattctgagtgaattccatggatgtaatgtgaatgcctcca 
cctacatgcttcgaactaaccactggaaatatatagcctattcggatggtgcatcaatattg 
cctcaactctttgatctttcctcggatccagatgaattaacaaatgttgctgtaaaatttcc 
agaaattacttattctttggatcagaagcttcattccattataaactaccctaaagtttctg 
cttctgtc caccag tataat aaagagcagtttat caagtggaaacaaagt ataggacagaat 
tattcaaacgttatagcaaatcttaggtggcaccaagactggcagaaggaaccaaggaagta 
tgaaaatg caattg at cagtgg cttaaaac cc at atgaatc caagagcagtttgaacaaaaa 
gtttaaaaatagtgttctagagatacatataaatatattacaagatcataattatgtatttt 
aaatgaaacagttttaataattaccaagttttggccgggcacagtggctcacacctgtaatc 
cc aggactttgggagg ctgagg aaagcagat cacaaggtcaagagattgagacc at cctgg c 
caacatggtgaaaccctgtctctactaaaaatacaaaaattagctgggcgcggtggtgcaca 

CCTAT AGT CT CAGCTACT C AGAGGCTG AGGCAGG AGGATCGCTTGAAC C CGGGAGG CAGC AG 
TTGCAGTGAG CTGAGATTGCGCCACTGTACTCCAGC CTGGCAACAGAGTGAGACTGTGTCGC 
AAAAAAATAAAAATAAAATAATAATAATTACCAATTTTTCATTATTTTGTAAGAATGTAGTG 
TATTTTAAGAT AAAATGC CAATGATTATAAAATCAC AT ATTTTC AAAAATGGTT ATT ATT TA 
GGCCTTTGTACAATTTCTAACAATTTAGTGGAAGTATCAAAAGGATTGAAGCAAATACTGTA 
ACAGTTATGTT CCTTTAAAT AATAGAGAAT ATAAAATATTGT AATAAT ATGT AT CATAAAAT 
AGTTGTATGTGAGCATTTGATGGTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA64885 
<subunit 1 of 1, 536 aa, 1 stop 
<MW: 61450, pi: 9.17, NX(S/T): 7 

MLLLWVSVVAALALAVLAPGAGEQRRRAAKAPIWVLVVSDSFDGRLTFHPGSQV^ 

MKTRGTSFI^AYTNSPICCPSRAAMWSGLFTHLTESWNNFKGLDPNYTTWMDVMERHGYRTQ 

KFGKLDYTSGHHS I SNRVEAWTRDVAFLLRQEGRPMVNLIRNRTKVRVMERDWQNTDKAVNW 

LRKEAINYTEPFVIYLGLNLPHPYPSPSSGENFGSSTFHTSLYWLEKVSHDAIKIPKWSPLS 

EMHPVDYYSSYTKNCTGRFTKKEIKNIRAFYYAMCAETDAMLGEIILALHQLDLLQKTIVIY 

SSDHGELAMEHRQFYKMSMYEASAHVPLLMMGPGIKAGLQVSNWSLVDIYPTMLDIAGIPL 

PQNLSGYSLLPLSSETFKNEHKVKNLHPPWILSEFHGCNVNASTYMLRTNHWKYIAYSDGAS 

ILPQLFDLSSDPDELTNVAVKFPEITYSLDQKLHSIINYPKVSASVHQYNKEQFIKWKQSIG 

QNYSNV I ANLRWHQDWQKE PRKYENAI DQWLiKTHMNPRAV 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites . 

amino acids 108-111, 166-169, 193-196, 262-265, 375-378, 413-416, 
498-501 

Sulf atases proteins : 

amino acids 286-315, 359-369, 78-97 
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GAGAGAAGTCAGCCTGGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCAAG 
AG CTTCAG C CTGAAGACAAGGG AG CAGTCC CTGAAG ACGCTTCT ACTG AGAGGT CTGC CATG 
GCCTCTCTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACT 
GGTTGCCATGCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAG 
CAGTTGGCTTCTCCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAG 
TGTGACATCTATAGCACCCTTCTGGGCCTGCCCGCTGACATCCAGGGTGCCCAGGCCATGAT 
GGTGACATCCAGTGCAATCTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCA 
CAGTCTTCTGCCAGGAATCCCGAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTC 
ATCCTTGGAGGCCTCCTGGGATTCATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGA 
CTTCTACTCACCACTGGTGCCTGACAGCATGAAATTTGAGATTGGAGAGGCTCTTTACTTGG 
GCATTATTTCTTCCCTGTTCTCCCTGATAGCTGGAATCATCCTCTGCTTTTCCTGCTCATCC 
CAGAGAAATCGCTCCAACTACTACGATGCCTACCAAGCCCAACCTCTTGCCACAAGGAGCTC 
TCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGAGTTCAATTCCTACAGCCTGACAGGGT 
ATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTGGGTCTGTGAAAAACAGTGGACAG 
CACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGTGTCAGAAGGTGCTGCTGAGG 
ATAGACTG ACT TTGGC CATTGGATTG AG CAAAGG C AGAAATGGGGG CT AGTGTAAC AG CATG 
CAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCCTCACCTTGCTGCTC 
CCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCAGGACTCAGAGG 
ATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACATCCCACTG 
ACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATTGCTGG 
GGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTCCC 
TCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCC 
AG AC T AATTTGTGCAT GAACTG AAATAAAACCATC CTACGGTAT C CAGGG AACAGAAAGC AG 
GATGCAGGATGGGAGGACAGGAAGGCAGCCTGGGACATTTAAAAAAATA 
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FT PURE 78 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA64886 
xsubunit 1 of 1, 230 aa, 1 stop 
><MW: 24549, pi: 8.56, NX(S/T): 1 

MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGIT 
QCDIYSTLLGLPADIQAAQAI^VTSSAISSLACIISVVGMRCTVFCQESRAKDRVAVAGGVF 
FILGGLLGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCS 
S QRNRSNY YDAYQ AQ P LATRS S PRPGQ P PKVKS E FNS YS LTG YV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains : 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site . 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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FIGURE 79 



GCACTGCTGCTGTCCCATCAGCTGCTCTGAAGCTCCATGGTGCCCAGAATCTTCGCTCCTGC 
TTATGTGTCAGTCTGTCTCCTCCTCTTGTGTCCAAGGGAAGTCATCGCTCCCGCTGGCTCAG 
AACCATGGCTGTGCCAGCCGGCACCCAGGTGTGGAGACAAGATCTACAACCCCTTGGAGCAG 
TGCTGTTACAATGACGCCATCGTGTCCCTGAGCGAGACCCGCCAATGTGGTCCCCCCTGCAC 
CTTCTGGCCCTGCTTTGAGCTCTGCTGTCTTGATTCCTTTGGCCTCACAAACGATTTTGTTG 
TGAAGCTGAAGGTTCAGGGTGTGAATTCCCAGTGCCACTCATCTCCCATCTCCAGTAAATGT 
GAAAGCAGAAGACGTTTTCCCTGAGAAGACATAGAAAGAAAATCAACTTTCACTAAGGCATC 
TCAGAAACATAGGCTAAGGTAATATGTGTACCAGTAGAGAAGCCTGAGGAATTTACAAAATG 
ATGC AG CTC CAAGC CATTGT AT GG C C CATGTGGG AG ACTGATGGG ACATGGAGAATG AC AGT 
AGATTATCAGGAAATAAATAAAGTGGTTTTTCCAATGTACACACCTGTAAAA 



77 
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FIGURE 80 

MVPRIFAPAYVSVCLLLLCPREVIAPAGSEPWLCQPAPRCGDKIYNPLEQCCYNDAIVSLSE 
TRQCGPPCTFWPCFELCCLDSFGLTNDFWKLKVQGVNSQCHSSPISSKCESRRRFP 

Signal peptide: 

amino acids 1-25 
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FIGURE 81 



CTCCACTGCAACCACCCAGAGCCATGGCTCCCCGAGGCTGCATCGTAGCTGTCTTTGCCATT 
TTCTGCATCTCCAGGCTCCTCTGCTCACACGGAGCCCCAGTGGCCCCCATGACTCCTTACCT 
GATGCTGTGCCAGCCACACAAGAGATGTGGGGACAAGTTCTACGACCCCCTGCAGCACTGTT 
GCTATGATGATGCCGTCGTGCCCTTGGCCAGGACCCAGACGTGTGGAAACTGCACCTTCAGA 
GTCTGCTTTGAGCAGTGCTGCCCCTGGACCTTCATGGTGAAGCTGATAAACCAGAACTGCGA 
CTCAGCCCGGACCTCGGATGACAGGCTTTGTCGCAGTGTCAGC TAA T.GGAACATCAGGGGAA 
CGATGACTCCTGGATTCTCCTTCCTGGGTGGGCCTGGAGAAAGAGGCTGGTGTTACCTGAGA 
TCTGGGATGCTGAGTGGCTGTTTGGGGGCCAGAGAAACACACACTCAACTGCCCACTTCATT 
CTGTGACCTGTCTGAGGCCCACCCTGCAGCTGCCCTGAGGAGGCCCACAGGTCCCCTTCTAG 
AATTCTGGACAGCATGAGATGCGTGTGCTGATGGGGGCCCAGGGACTCTGAACCCTCCTGAT 
GACCCCTATGGCCAACATCAACCCGGCACCACCCCAAGGCTGGCTGGGGAACCCTTCACCCT 
TCTGTGAGATTTTCCATCATCTCAAGTTCTCTTCTATCCAGGAGCAAAGCACAGGATCATAA 
TAAATTTATGTACTTTATAAATGAAAA 



SI /X7* 
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FIGURE 82 

MAPRGCIVAVFAIFCISRLLCSHGAPVAPMTPYLMLCQPHKRCGDKFYDPLQHCCYDDAWP 
LARTQTCGNCTFRVCFEQCCPVJTFMVKLINQNCDSARTSDDRLCRSVS 

Signal peptide: 

amino acids 1-24 
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FIGURE 83 



GGGGGCGGGTGCCTGGAGCACGGCGCTGGGGCCGCCCGCAGCGCTCACTCGCTCGCACTCAG 
TCGCGGGAGGCTTCCCCGCGCCGGCCGCGTCCCGCCCGCTCCCCGGCACCAGAAGTTCCTCT 
GCGCGTCCGACGGCGACATGGGCGTCCCCACGGCCCTGGAGGCCGGCAGCTGGCGCTGGGGA 
TCCCTGCTCTTCGCTCTCTTCCTGGCTGCGTCCCTAGGTCCGGTGGCAGCCTTCAAGGTCGC 
CACGCCGTATTCCCTGTATGTCTGTCCCGAGGGGCAGAACGTCACCCTCACCTGCAGGCTCT 
TGGGCCCTGTGGACAAAGGGCACGATGTGACCTTCTACAAGACGTGGTACCGCAGCTCGAGG 
GGCGAGGTGCAGACCTGCTCAGAGCGCCGGCCCATCCGCAACCTCACGTTCCAGGACCTTCA 
CCTGCACCATGGAGGCCACCAGGCTGCCAACACCAGCCACGACCTGGCTCAGCGCCACGGGC 
TGGAGTCGGCCTCCGACCACCATGGCAACTTCTCCATCACCATGCGCAACCTGACCCTGCTG 
GATAGCGGCCTCTACTGCTGCCTGGTGGTGGAGATCAGGCACCACCACTCGGAGCACAGGGT 
CCATGGTGCCATGGAGCTGCAGGTGCAGACAGGCAAAGATGCACCATCCAACTGTGTGGTGT 
ACCCATCCTCCTCCCAGGATAGTGAAAACATCACGGCTGCAGCCCTGGCTACGGGTGCCTGC 
ATCGTAGGAATCCTCTGCCTCCCCCTCATCCTGCTCCTGGTCTACAAGCAAAGGCAGGCAGC 
CTCCAACCGCCGTGCCCAGGAGCTGGTGCGGATGGACAGCAACATTCAAGGGATTGAAAACC 
CCGGCTTTGAAGCCTCACCACCTGCCCAGGGGATACCCGAGGCCAAAGTCAGGCACCCCCTG 
TCCTATGTGGCCCAGCGGCAGCCTTCTGAGTCTGGGCGGCATCTGCTTTCGGAGCCCAGCAC 
CCCCCTGTCTCCTCCAGGCCCCGGAGACGTCTTCTTCCCATCCCTGGACCCTGTCCCTGACT 
CTCCAAACTTTGAGGTCATCTAGCCCAGCTGGGGGACAGTGGGCTGTTGTGGCTGGGTCTGG 
GGCAGGTGCATTTGAGCCAGGGCTGGCTCTGTGAGTGGCCTCCTTGGCCTCGGCCCTGGTTC 
CCTCCCTCCTGCTCTGGGCTCAGATACTGTGACATCCCAGAAGCCCAGCCCCTCAACCCCTC 
TGGATGCTACATGGGGATGCTGGACGGCTCAGCCCCTGTTCCAAGGATTTTGGGGTGCTGAG 
ATTCTCCCCTAGAGACCTGAAATT CACCAGCTACAGATGC CAAATGACTTACATCTTAAGAA 
GTCTCAGAACGTCCAGCCCTTCAGCAGCTCTCGTTCTGAGACATGAGCCTTGGGATGTGGCA 
GCATCAGTGGGACAAGATGGACACTGGGCCACCCTCCCAGGCACCAGACACAGGGCACGGTG 
GAGAGACTTCTCCCCCGTGGCCGCCTTGGCTCCCCCGTTTTGCCCGAGGCTGCTCTTCTGTC 
AGACTTCCTCTTTGTACCACAGTGGCTCTGGGGCCAGGCCTGCCTGCCCACTGGCCATCGCC 
ACCTTCCCCAGCTGCCTCCTACCAGCAGTTTCTCTGAAGATCTGTCAACAGGTTAAGTCAAT 
CTGGGGCTTCCACTGCCTGCATTCCAGTCCCCAGAGCTTGGTGGTCCCGAAACGGGAAGTAC 
ATATTGGGGCATGGTGGCCTCCGTGAGCAAATGGTGTCTTGGGCAATCTGAGGCCAGGACAG 
ATGTTGCCCCACCCACTGGAGATGGTGCTGAGGGAGGTGGGTGGGGCCTTCTGGGAAGGTGA 
GTGGAGAGGGGCACCTGCCCCCCGCCCTCCCCATCCCCTACTCCCACTGCTCAGCGCGGGCC 
ATTGCAAGGGTGCCACACAATGTCTTGTCCACCCTGGGACACTTCTGAGTATGAAGCGGGAT 
GCTATTAAAAACTACATGGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGA 
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FIGURE 84 



></usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA64 8 97 
xsubunit 1 of 1, 311 aa, 1 stop 
><MW: 33908, pi: 6.87, NX(S/T): 6 

MGVPTALEAGSWRWGSLLFALFLAASLGPVAAFKVATPYSLYVCPEGQNVTLTCRLLGPVDK 
GHDVTFYKTWYRSSRGEVQTCSERRPIRNLTFQDLHLHHGGHQAANTSHDLAQRHGLESASD 
HHGNFS ITMRNLTLLDSGLYCCLWE I RHHHS EHR VHGAMELQ VQTG KD APSNCWYPS S S Q 
DSEN I TAAALATGAC I VG I LCL PL I LLL VYKQRQAASNRRAQE LVRMDSNI QG I ENPGFE AS 
PPAQGIPEAKVRHPLSYVAQRQPSESGRHLLSEPSTPLSPPGPGDVFFPSLDPVPDSPNFEVI 

Signal peptide: 

amino acids 1-2 8 

Transmembrane domain: 

amino acids 190>216 
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FIGURE 85 

CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCT 
TTCCCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCC 
TTTCCTGCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGG 
GGTCTGTGGGTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGA 
CTCCGCTCCCGGACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTC 
TCCTCCTTGCTGGGACTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCC 
AGACATGTTCTGCCTTTTCCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACT 
TGGAGCCACAAGGCCTGATGTACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGT 
TGTTACCGCCTCCACTGTCCGCCTGTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATG 
CTGTCCCAAGTGTGTGGAACCTCACACTCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCC 
AGCACAACGGGACCATGTACCAACACGGAGAGATCTTCAGTGCCCATGAGCTGTTCCCCTCC 
CGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGCACAGAGGGCCAGATCTACTGCGGCCTCAC 
AACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCCACTGCCAGACTCCTGCTGCCAAGCCT 
GCAAAGATGAGGCAAGTGAGCAAT CGGATG AAGAGG AC AGTGTGCAGT CG CT C CATGGGGTG 
AGACATC CT CAGGAT CCATG TT CC AGTG ATG CTGGG AGAAAGAG AGGC CCGGGC AC CCCAGC 
CCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTTCAGACCCAAGGGAGCAG 
GCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCTGTGTGCATGGCGGG 
AAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGCCCCTTGCCCTG 
CATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCCCACCGAGT 
ACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGGACAAA 
GCAGACCCTGGCCACAGTGAGATC7VGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTCCT 
CGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGG 
CCTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAG 
AGAGGTGAAGTACCTGGC C CAAGG CC AC AC AG CC AGAAT CTT CC ACTTGACT CAGATC AAGA 
AAGTCAGGAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCAC 
GAAGGTCACTGGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGA 
CAAAGTGACCAAG AC ATAAC AAAG AC CTAAC AGTTG C AGATATGAG CTGT ATAATTGTTGTT 
ATTATATATTAATAAATAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 



# 
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FIGURE 86 



></usr/seqdb2/sst /DNA/Dnaseqs . min/ ss . DNA64 902 
xsubunit 1 of 1, 451 aa, 1 stop 
><MW: 49675, pi: 7.15, NX(S/T): 1 

MVPEWVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCT 
CSEGAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIF 
SAHELFPSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEED 
SVQSLHGVRHPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKH 
KKACVHGGKTYSHGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKC 
CKICPEDKADPGHSE I SSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVE I YLWKLV 
KDEETEAQRGEVPGPRPHSQNLPLDSDQESQEARLPERGTALPTARWPPRRSLE^LPSPDPG 
AEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 37 



CTAGCCTGCGCCAAGGGGTAGTGAGACCGCGCGGCAACAGCTTGCGGCTGCGGGGAGCTCCC 

GTGGGCGCTCCGCTGGCTGTGCAGGCGGCCATGGATTCCTTGCGGAAAATGCTGATCTCAGT 

CG CAATG CTGGGCGCAGGGG CTGGCGTGGG CTACGCG CTCCTCGTTATCGTG AC CCCGGG AG 

AGCGGCGGAAGCAGGAAATGCT AAAGGAGATGCC ACTG CAGGAC CCAAGGAGCAGGGAGGAG 

GCGGCCAGGACCCAGCAGCTATTGCTGGCCACTCTGCAGGAGGCAGCGACCACGCAGGAGAA 

CGTGGCCTGGAGGAAGAACTGGATGGTTGGCGGCGAAGGCGGCGCCAGCGGGAGGTCACCGT 

GAGACCGGACTTGCCTCCGTGGGCGCCGGACCTTGGCTTGGGCGCAGGAATCCGAGGCAGCC 

TTTCTCCTTCGTGGGCCCAGCGGAGAGTCCGGACCGAGATACCATGCCAGGACTCTCCGGGG 

TCCTGTGAGCTGCCGTCGGGTGAGCACGTTTCCCCCAAACCCTGGACTGACTGCTTTAAGGT 

CCGCAAGGCGGGCCAGGGCCGAGACGCGAGTCGGATGTGGTGAACTGAAAGAACCAATAAAA 

TC^TGTTCCTCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAA 
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FIGURE 88 

MDSLRKMLISVAMLGAGAGVGYALLVIVTPGERRKQEMLKEMPLQDPRSREEAARTQQLLLA 
TLQEAATTQENVAWRKNWMVGGEGGASGRSP 

Signal peptide: 

amino acids 1-18 
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FIGURE 89 



CAGGAGAGAAGGCACCGCCCCCACCCCGCCTCCAAAGCTAACCCTCGGGCTTGAGGGGAAGA 

GG CTGACTGTACGTTCCTTCTACTCTGGCACCACTCTCCAGGCTGCCATGGGGCCCAGCACC 

CCTCTCCTCATCTTGTTCCTTTTGTCATGGTCGGGACCCCTCCAAGGACAGCAGCACCACCT 

TGTGGAGTACATGGAACGCCGACTAGCTGCTTTAGAGGAACGGCTGGCCCAGTGCCAGGACC 

AGAGTAGTCGGCATGCTGCTGAGCTGCGGGACTTCAAGAACAAGATGCTGCCACTGCTGGAG 

GTGGCAGAGAAGGAGCGGGAGG CACT CAGAAC TG AGG C CG ACAC CAXCTCCGGGAGAGTGGA 

TCGTCTGGAGCGGGAGGTAGACTATCTGGAGACCCAGAACCCAGCTCTGCCCTGTGTAGAGT 

TTGATGAG AAGGTGACTG GAGG C C CTGGGAC C AAAGG CAAGGGAAGAAGGAATG AG AAGT AC 

GATATGGTGACAGACTGTGGCTACACAATCTCTCAAGTGAGATCAATGAAGATTCTGAAGCG 

ATTTGGTGGCCCAGCTGGTCTATGGACCAAGGATCCACTGGGGCAAACAGAGAAGATCTACG 

TGTTAGATGGGACACAGAATGACACAGCCTTTGTCTTCCCAAGGCTGCGTGACTTCACCCTT 

GCCATGGCTGCCCGGAAAGCTTCCCGAGTCCGGGTGCCCTTCCCCTGGGTAGGCACAGGGCA 

GCTGGTATATGGTGGCTTTCTTTATTTTGCTCGGAGGCCTCCTGGAAGACCTGGTGGAGGTG 

GTGAGATGGAGAACACTTTGCAGCTAAT C AAATT CC ACCTGG CAAACCGAAC AGTGGTGGAC 

AGCTC AGTATTCCCAGCAGAGGGG CTGATC C CC CCCT ACGG CTTG ACAG C AGAC AC CTACAT 

CGACCTGGTAGCTGATGAGGAAGGTCTTTGGGCTGTCTATGCCACCCGGGAGGATGACAGGC 

ACTTGTGT CTGG CCAAGTTAGATC CAGAG AC ACTGG AC AC AG AG CAG CAGTGGGACACACCA 

TGTCCCAGAGAGAATGCTGAGGCTGCCTTTGTCATCTGTGGGACCCTCTATGTCGTCTATAA 

CACCCGTCCTGCCAGTCGGGCCCGCATCCAGTGCTCCTTTGATGCCAGCGGCACCCTGACCC 

CTGAACGGGCAGCACTCCCTTATTTTCCCCGCAGATATGGTGCCCATGCCAGCCTCCGCTAT 

AACC CCCGAGAACGCCAGCT CTATGCCTGGGATGATGGCTACCAGATTGT CTATAAGCTGGA 

GATGAGGAAGAAAGAGGAGGAGGTTTGAGGAGCTAGCCTTGTTTTTTGCATCTTTCTCACTC 

CCATACATTTATATTATATCCCCACTAAATTTCTTGTTCCTCATTCTTCAAATGTGGGCCAG 

TTGTGGCTCAAATCCTCTATATTTTTAGCCAATGGCAATCAAATTCTTTCAGCTCCTTTGTT 

TC AT ACGG AACT CCAGAT C CTG AGTAAT CCTTTT AG AGCC CG AAGAGT CAAAAC C CTCAATG 

TTCCCTCCTGCTCTCCTGCCCCATGTCAACAAATTTCAGGCTAAGGATGCCCCAGACCCAGG 

GCTCTAACCTTGTATGCGGGCAGGCCCAGGGAGCAGGCAGCAGTGTTCTTCCCCTCAGAGTG 

ACTTGGGGAGGGAGAAATAGGAGGAGACGTCCAGCTCTGTCCTCTCTTCCTCACTCCTCCCT 

TCAGTGTCCTGAGGAACAGGACTTTCTCCACATTGTTTTGTATTGCAACATTTTGCATTAAA 

AGGAAAATCCACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 90 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA64905 
<subunit 1 of 1, 406 aa, 1 stop 
<MW: 46038, pi: 6.50, NX(S/T): 2 

MGPSTPLLILFLLSWSGPLQGQQHHLVEYMERRLAALEERLAQCQDQSSRHAAELRDFKNKM 
LPLLEVAEKERE ALRTEADT I SGRVDRLEREVDYLETQNP ALPCVEFDEKVTGGPGTKGKGR 
RNEKYDMVTDCGYTISQVRSMKILKRFGGPAGLWTKDPLGQTEKIYVLDGTQNDTAFVFPRL 
RDFTLAMAARKASRVRVPFPWVGTGQLVYGGFLYFARRP 

RTWDS S VF P AEGL I P P YGLTADT Y I DLVADEEGLWAVYATREDDRHLCL AKLD PQTLDTEQ 
QWDTPCPRENAEAAFVICGTLYVVYNTRPASRARIQCSFDASGTLTPERAALPYFPRRYGAH 
ASLRYNPRERQLYAWDDGYQ I VYKLEMRKKEEEV 

Important features : 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 177-180, 248-251 
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FIGURE 91 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAG 
AGCCTCTCCGTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCAT 
CCGTCACCTCTCCTGTCATCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGG 
CTCTCATGCTCAGTTTGGTTCTGAGTCTCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTT 
GGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAGGACGCAGCATTCTCCTGTTTCCTGTC 
TCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGGCCAGTTCTCTAGCGTGG 
TC CACCTCTACAGGGACGGG AAGG AC C AGC CATTTATG CAGATGC C ACAGTATCAAGGCAGG 
ACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTAC 
TGTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCA 
TCTGGGAGCTACAGGTGTCAGC ACTGGGCT CAGTTCCT CTCATTTCCATCACGGGATATGTT 
GATAGAGACATCCAGCTACTCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTG 
GAAAGGTC CACAAGGACAGGATTTGT CCAC AGACTCCAGGACAAACAGAGACATGCATGGCC 
TGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAACGCCGGGAGCATATCCTGTTCCATG 
CGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAGGAGATACCTTTTTCGA 
GCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTATTTTTTG 
GCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGAC 
TGGAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGC C CGGAAACACGC AGTGGAGGTGAC 
TCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATA 
GAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGATTTACAAGGAAGAGTGTGGTGGCT 
TCT CAGAGTTTC CAAGCAGGGAAACATTACTGGGAGGT GG ACGG AGGACACAATAAAAGGTG 
GCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGAAGGAGTACGTGACTTTGTCTCCCG 
ATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTCACATTAAATCCCCGT 
TTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTATGAGTG 
TGGGAC CATCTCCTTCTT CAACATAAATGACC AGTC CCTTATTTATACCCTGACATGTCGGT 
TTGAAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCC 
ATAGTCATCTGCCCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGC 
AATCCCAGAGACAAGCAACAGTGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGG 
GTGAAATGTAGGATGAATCACATCCCACATTCTTCTTTAGGGATATTAAGGTCTCTCTCCCA 
GATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCAGATGAAGGGGGACTGGCCTGTCC 
ACATGGGAGTCAGGTGTCATGG CTGC CCTGAGCTGGGAGGGAAGAAGGCTGACATTACATTT 
AGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAGAACCG 
TCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATAATGC 
TT AG AT CTTATTGATG AC AGAGTGTATC CTAATGGTTTGTTC ATTATATT ACAC TT TC AGTA 
AAAAAA 
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FIGURE 92 



MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSS 
WHLYRDGKDQPFMQMPQYQGRTKLVKDS I AEGR I SLRLENI TVLDAGLYGCRI SSQS YYQK 
AI WELQVS ALGS VPL I S I TG YVDRD I QLLCQS SG WF PRPTAKWKGPQGQDLS TD SRTNRDMH 
GLFDVEISLTVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLF 
FGIVGLKIFFSKFQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVT 
HRKAPQEVPHSEKRFTRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLS 
PDHGYWVLRLNGEHLYFTLNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTC 
RFEGLLRPYIEYPSYNEQNGTPIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLP 
RGEM 

Signal peptide: 

amino ac ids 1-17 

Transmembrane domain : 

amino acids 239-255 
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FIGURE 93 



GCGATGGTGCGCCCGGTGGCGGTGGCGGCGGCGGTTGCGGAGGCTTCCTTGGTCGGATTGCA 
ACGAGGAGAAGATGACTGACCAACCGACTGGCTGAATGAATGAATGGCGGAGCCGAGCGCGC 
CATGAGGAGCCTGCCGAGCCTGGGCGGCCTCGCCCTGTTGTGCTGCGCCGCCGCCGCCGCCG 
CCGTCGCCTCAGCCGCCTCGGCGGGGAATGTCACCGGTGGCGGCGGGGCCGCGGGGCAGGTG 
GACGCGTCGCCGGGCCCCGGGTTGCGGGGCGAGCCCAGCCACCCCTTCCCTAGGGCGACGGC 
TCCCACGGCCCAGGCCCCGAGGACCGGGCCCCCGCGCGCCACCGTCCACCGACCCCTGGCTG 
CGACTTCTCCAGCCCAGTCCCCGGAGACCACCCCTCTTTGGGCGACTGCTGGACCCTCTTCC 
ACCACCTTTCAGGCGCCGCTCGGCCCCTCGCCGACCACCCCTCCGGCGGCGGAACGCACTTC 
GACCACCTCTCAGGCGCCGACCAGACCCGCGCCGACCACCCTTTCGACGACCACTGGCCCGG 
CGCCGACCACCCCTGTAGCGACCACCGTACCGGCGCCCACGACTCCCCGGACCCCGACCCCC 
GATCTCCCCAGCAGCAGCAACAGCAGCGTCCTCCCCACCCCACCTGCCACCGAGGCCCCCTC 
TTCGCCTCCTCCAGAGTATGTATGTAACTGCTCTGTGGTTGGAAGCCTGAATGTGAATCGCT 
GCAACCAGACCACAGGGCAGTGTGAGTGTCGGCCAGGTTATCAGGGGCTTCACTGTGAAACC 
TGCAAAGAGGGCTTTTACCTAAATTACACTTCTGGGCTCTGTCAGCCATGTGACTGTAGTCC 
ACATGGAGCTCTCAGCATACCGTGCAACAGGTAAGCAACAGAGGGTGGAACTGAAGTTTATT 
TTATTTTAGCAAGGGAAAAAAAAAGGCTGCTACTCTCAAGGACCATACTGGTTTAAACAAAG 
GAGGATGAGGGTCATAGATTTACAAAATATTTTATATACTTTTATTCTCTTACTTTATATGT 
TATATTTAATGTCAGGATTTAAAAACATCTAATTTACTGATTTAGTTCTTCAAAAGCACTAG 
AGTCGCCAATTTTTCTCTGGGATAATTTCTGTAAATTTCATGGGAAAAAATTATTGAAGAAT 
AAATCTGCTTTCTGGAAGGGCTTTCAGGCATGAAACCTGCTAGGAGGTTTAGAAATGTTCTT 
ATGTTTATTAATATACCATTGGAGTTTGAGGAAATTTGTTGTTTGGTTTATTTTTCTCTCTA 
ATCAAAATTCTACATTTGTTTCTTTGGACATCTAAAGCTTAACCTGGGGGTACCCTAATTTA 
TTTAACTAGTGGTAAGTAGACTGGTTTT ACT CTATTTAC C AGTACATTTTTGAG ACCAAAAG 
TAGATTAAGCAGGAATTATCTTTAAACTATTATGTTATTTGGAGGTAATTTAATCTAGTGGA 
ATAATGTACTGTTATCTAAGCATTTGCCTTGTACTGCACTGAAAGTAATTATTCTTTGACCT 
TATGTGAGGCACTTGGCTTTTTGTGGAC CC CAAGTCAAAAAACTGAAGAGACAGTATTAAAT 
AATGAAAAAAATAATGACAGGTTATACTCAGTGTAACCTGGGTATAACCCAAGATCTGCTGC 
CACTTACGAGCTGTGTTCCTTGGGCAAGTAATTTCCTTTCACTGAGCTTGTTTCTTCTCAAG 
GTTGTTGTGAAGATTAAATGAGTTGATATATATAAAATGCCTAGCACATGTCACTCAATAAA 
TTCTGGTTTGTTTTAATTTCAAAGGAATATTATGGACTGAAATGAGAGAACATGTTTTAAGA 
ACTTTTAGCTCCTTGACAAAGAAGTGCTTTATACTTTAGCACTAAATATTTTAAATGCTTTA 
TAAATGATATTATACTGTTATGGAATATTGTATCATATTGTAGTTTATTAAAAATGTAGAAG 
AGGCTGGGCGCGGTGGCTCACGCCTGTAATCCTAGCACTTTGGGAGGCCAAGGCGGGTGGAT 
CACTTG AGG C CAGGAGTTCTAGATGAGC CTGGC C AG CACAGTGAAACC CCGT CT CT ACTAAA 
AATACAAACAAATTAGCTGGGCGTGGTGGCACACACCTGTAGTCCCAGCTACTCGGGAGGCT 
GAGGCAGGAGAATCGGTTGAACCCGGGAGGTGGAGGTTGCAGTGAGCTGAGATCGCGCCACT 
GC ACT CC AGCCTGGTGAG AG AGGGAGACTCTGT CTTAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 94 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA64952 
xsubunit 1 of 1, 258 aa, 1 stop 
><MW: 25716, pi: 8.13, NX(S/T): 5 

MRSLPSLGGLALLCCAAAAAAVASAASAGNVTGGGGAAGQVDASPGPGLRGEPSHPFPRATA 
PTAQAPRTGP PRATVHRPLAATS PAQS PETTPLWATAGPS STTFQAPLGPS PTTPPAAERTS 
TTSQAPTRPAPTTLSTTTGPAPTTPVATTVPAPTTPRTPTPDLPSSSNSSVLPTPPATEAPS 
SPPPEYVCNCSWGSLNVNRCNQTTGQCECRPGYQGLHCETCKEGFYLNYTSGLCQPCDCSP 
HGALS I PCNR 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 30-33, 172-175, 195-198, 208-211, 235-238 

EGF-like domain cysteine pattern signature. 

amino acids 214-226. 
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FIGURE 95 



TGCGGCGCAGTGTAGACCTGGGAGGATGGGCGGCCTGCTGCTGGCTGCTTTTCTGGCTTTGG 
TCTCGGTGCCCAGGGCCCAGGCCGTGTGGTTGGGAAGACTGGACCCTGAGCAGCTTCTTGGG 
CCCTGGTACGTGCTTGCGGTGGCCTCCCGGGAAAAGGGCTTTGCCATGGAGAAGGACATGAA 
GAACGTCGTGGGGGTGGTGGTGACCCTCACTCCAGAAAACAACCTGCGGACGCTGTCCTCTC 
AGCACGGGCTGGGAGGGTGTGACCAGAGTGTCATGGACCTGATAAAGCGAAACTCCGGATGG 
GTGTTTGAGAATCCCTCAATAGGCGTGCTGGAGCTCTGGGTGCTGGCCACCAACTTCAGAGA 
CTATGCCATCATCTTCACTCAGCTGGAGTTCGGGGACGAGCCCTTCAACACCGTGGAGCTGT 
ACAGTCTGACGGAGACAGCCAGCCAGGAGGCCATGGGGCTCTTCACCAAGTGGAGCAGGAGC 
CTGGGCTTCCTGTCACAGTAGCAGGCCCAGCTGCAGAAGGACCTCACCTGTGCTCACAAGAT 
CCTTCTGTGAGTGCTGCGTCCCCAGTAGGGATGGCGCCCACAGGGTCCTGTGACGTCGGCCA 
GTGTCCACCCACCTCGCTCAGCGGCTCCCGGGGCCCAGCACCAGCTCAGAATAAAGCGATTC 
CACAGCA 
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FIGURE 96 



MGGLLLAAFLALVS VPRAQAVWLGRLD PEQLLGP WYVLAVAS RE KG FAME KDMKNWGVWT 
LTPENNLRTLSSQHGLGGCDQSVMDLIKRNSGWVFENPSIGVLELWVLATNFRDYAIIFTQL 
EFGDEPFNTVELYSLTETASQEAMGLFTKWSRSLGFLSQ 

Signal peptide: 

amino acids 1-20 
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FIGURE 97 



AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCC 
CCTG CTCTGGGGGAGGG AGAGGG CGG AAGGAC AGAC AAGTAAAC TGCTGACG ATG C AGAGTT 
CCGTGACGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCAT 
GGCTGGATTTACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGA 
CCAGGATGCTCCAGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACC 
GATTC CACCTCCTTGGGG AC CC AC ATAC CAAG AATTGC AC CCTGAG CATCAGAGATGC CAG A 
AGAAGTGATG CGGGG AGATACTT CTT T CGT ATGG AGAAAGGAAGTATAAAATGGAATT ATAA 
ACATCACCGGCTCTCTGTGAATGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAG 
GCACCCTGGAGTCCGGCTGCCCCCAGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAG 
GGGACACCCCCTATGATCTCCTGGATAGGGACCTCCGTGTCCCCCCTGGACCCCTCCACCAC 
CCGCTCCTCGGTGCTCACCCTCATCCCACAGCCCCAGGACCATGGCACCAGCCTCACCTGTC 
AGGTGACCTTCCCTGGGGCCAGCGTGACCACGAACAAGACCGTCCATCTCAACGTGTCCTAC 
CCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGAGACGGCACAGTATCCACAGTCTTGGG 
AAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCTGCGCCTGGTCTGTGCAGTTGATG 
CAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGAGAGGCCTGACCCTGTGCCCC 
TCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCACCTGAGGGATGCAGCTGA 
ATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCTGAACGTCTCCCTGC 
AGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAGCCACAGCCCTG 
GTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAATCGGCAAG 
GC CAG CAG CGGG CGTGGG AG ATACGGGCAT AGAGGATGC AAACGCTGTCAGGGGTT CAGC CT 
CTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAGCT 
TCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGT 
GAAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCC 
AC AG ATGAG AAACTG CAGAGACT C ACC CTGATTG AGGG AT CACAG C CCCTCC AGGC AAGGGA 
GAAGTCAGAGGCTGATTCTTGTAGAATTAACAGCCCTCAACGTGATGAGCTATGATAACACT 
ATGAATTATGTG CAGAGTGAAAAGCACACAGG CTTT AGAGTC AAAGTATCTC AAAC CTGAAT 
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FIGURE 98 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWF 
REGANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS I RDARRSDAGRYFFRMEKG 
SIKWNYKHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVS 
PLDPSTTRSSVLTL I PQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDG 
TVSTVLGNGSSLSLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWV 
HLRDAAEFTCRAQNPLGSQQVYLNVSLQSKATSGV^ 

CRKKSARPAAGVGDTGIEDANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYA 
S LS FQMVKP WDS RGQE ATDTE YS E I K I HR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 3 51-370 
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FIGURE 99 



GACGCCCAGTGACCTGCCGAGGTCGGCAGCACAGAGCTCTGGAGATGAAGACCCTGTTCCTG 
GGTGTCACGCTCGGCCTGGCCGCTGCCCTGTCCTTCACCCTGGAGGAGGAGGATATCACAGG 
GACCTGGTACGTGAAGGCCATGGTGGTCGATAAGGACTTTCCGGAGGACAGGAGGCCCAGGA 
AGGTGTCCCCAGTGAAGGTGACAGCCCTGGGCGGTGGGAAGTTGGAAGCCACGTTCACCTTC 
ATGAGGGAGGATCGGTGCAT C C AG AAGAAAAT CCTG ATG CGG AAGACGGAGG AG CCTGGCAA 
ATACAGCGCCTATGGGGGCAGGAAGCTC ATGTAC CTGCAGGAGCTG CC CAGGAGGGACCACT 

acatcttttactgcaaagac cagcac catgggggcctgct ccacatgggaaagcttgtgggt 
aggaat tctg at ac caac cggg aggc c ctg gaag aatttaagaaattggtgcag cgcaaggg 
actctcggaggaggacattttcacgcccctgcagacgggaagctgcgttcccgaacactagg 
cagcccccgggtctgcacctccagagccct^ccctaccaccagacacagagcccggaccacct 
ggacctaccctccagccatgacccttccctgctcccacccacctgactccaaataaagtcct 
tttccccc^aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa^ 
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FIGURE 100 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA654 04 
<subunit 1 of 1, 170 aa, 1 stop 



<MW: 19457, pi: 9.10, NX(S/T): 0 

MKTLFLGVTLGLAAALSFTLEEEDITGTWYVKAMVVDKDFPEDRRPRKVSPVKVTALGGGKL 
EATFTFMREDRCIQKKILMRKTEEPGKYSAYGGRKLMYLQELPRRDHYIFYCKDQHHGGLLH 
MGKLVGRNSDTNREALEEFKKLVQRKGLSEEDIFTPLQTGSCVPEH 

Important features : 
Signal peptide: 

amino acids 1-17 
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FIGURE 101 



GTT C CG CAGATGCAGAGGTTG AGGTGG CTG CGGG AC TGGAAGTC AT CG GG CAGAGG T CTC AC 
AGCAGCCAAGGAACCTGGGGCCCGCTCCTCCCCCCTCCAGGCCATGAGGATTCTGCAGTTAA 
TCCTGCTTGCTCTGGCAACAGGGCTTGTAGGGGGAGAGACCAGGATCATCAAGGGGTTCGAG 
TGCAAGCCTCACTCCGAGCCCTGGCAGGCAGCCCTGTTCGAGAAGACGCGGCTACTCTGTGG 
GGCGACGCTCATCGCCCCCAGATGGCTCCTGACAGCAGCCCACTGCCTCAAGCCCCGCTACA 
TAGTT CACCTGGGG CAG C AC AACCT C CAGAAGGAGGAG GG CTGTG AG C AGACCCGGACAGCC 
ACTGAGTCCTTCCCCCACCCCGGCTTCAACAACAGCCTCCCCAACAAAGACCACCGCAATGA 
CATCATGCTGGTGAAGATGGCATCGCCAGTCTCCATCACCTGGG CTGTG CGACCCCTCACCC 
TCTCCTCACGCTGTGTCACTGCTGGCACCAGCTGCCTCATTTCCGGCTGGGGCAGCACGTCC 
AGCCCCCAGTTACGCCTGCCTCACACCTTGCGATGCGCCAACATCACCATCATTGAGCACCA 
GAAGTGTGAGAACGCCTACCCCGGCAACATCACAGACACCATGGTGTGTGCCAGCGTGCAGG 
AAGGGGGCAAGGACTCCTGCCAGGGTGACTCCGGGGGCCCTCTGGTCTGTAACCAGTCTCTT 
CAAGGCATTATCTCCTGGGGCCAGGATCCGTGTGCGATCACCCGAAAGCCTGGTGTCTACAC 
GAAAGT CTG CAAAT ATGTGGACTGGATCC AGGAGACGATG AAGAAC AATTAGACTGGACC CA 
CCCACCACAGCCCATCACCCTCCATTTCCACTTGGTGTTTGGTTCCTGTTCACTCTGTTAAT 
AAGAAACCCTAAGCCAAGACCCTCTACGAACATTCTTTGGGCCTCCTGGACTACAGGAGATG 
CTGTCACTTAATAATCAACCTGGGGTTCGAAATCAGTGAGACCTGGATTCAAATTCTGCCTT 
GAAATATTGTGACTCTGGGAATGACAACACCTGGTTTGTTCTCTGTTGTATCCCCAGCCCCA 
AAGACAGCTC CTGGC C AT AT AT CAAGGTTT CAAT AAAT AT TTGC TAAATGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 102 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA65405 
<subunit 1 of 1, 250 aa, 1 stop 
<MW: 27466, pi: 8.87, NX(S/T): 4 

MRILQLILLALATGLVGGETRIIKGFECKPHSQPWQAALFEKTRLLCGATLIAPRWLLTAAH 
CLKPRYIVHLGQHNLQKEEGCEQTRTATESFPHPGFl^SLPNKDHRNDIMLVKMASPVSITW 
AVRPLTLSSRCVTAGTSCLISGWGSTSSPQLRLPHTLRCANITIIEHQKCENAYPGNITDTM 
VCASVQEGGKDSCQGDSGGPLVCNQSLQG 1 1 SWGQDPCAITRKPGVYTKVCKYVDW IQETMKNN 

Important features : 
Signal peptide : 

amino acids 1-18 

Serine proteases, trypsin family, histidine active site. 

amino acids 58-63 

N-glycosylation sites. 

amino acids 99-102, 165-168, 181-184, 210-213 

Glycosaminoglycan attachment site. 

amino acids 145-148 

Kringle domain proteins. 

amino acids 197-209, 47-64 

Serine proteases, trypsin family, histidine protein 

amino acids 199-209, 47-63, 220-243 

Apple domain proteins 

amino acids 222-249, 189-222 
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FIGURE 103 



GAGCAGTGTTCTGCTGGAGCCGATGCCAAAAACCATGCATTTCTTATTCAGATTCATTGTTT 
T CTTTTAT CTGTGGGGCCTTTTTACTGCTCAGAGAC AAAAGAAAGAGGAG AGCACCGAAGAA 
GTGAAAATAGAAGTTTTGCATCGTCCAGAAAACTGCTCTAAGACAAGCAAGAAGGGAGACCT 
ACTAAATGCCCATTATGACGGCTACCTGGCTAAAGACGGCTCGAAATTCTACTGCAGCCGGA 
CACAAAATGAAGGCCACCCCAAATGGTTTGTTCTTGGTGTTGGGCAAGTCATAAAAGGCCTA 
GACATTGCTATG AC AG AT ATGTGC C CTGGAGAAAAG CG AAAAGTAGTTATACCCC CTTCATT 
TGCATACG GAAAGGAAGG CT ATGC AG AAGG CAAG ATT C CAC CGGATG CTACATTGATTTTTG 
AGATTGAACTTT ATG CTGTG AC CAAAGGAC CACGGAGC ATTGAGACATTTAAACAAATAG AC 
ATGG ACAATG AC AGGC AG CT CT CT AAAG C CG AGATAAAC C TCT ACTTG CAAAGGGAATTTGA 
AAAAGATGAGAAGCCACGTGACAAGTCATATCAGGATGCAGTTTTAGAAGATATTTTTAAGA 
AG AATG AC CATG ATGGTG ATGG CT T C ATTT CT C CCAAGGAAT AC AATGTAT ACC AACACG AT 
GAACTATAGCATATTTGTATTTCTACTTTTTTTTTTTAGCTATTTACTGTACTTTATGTATA 
AAACAAAGTCACTTTTCTCCAAGTTGTATTTGCTATTTTTCCCCTATGAGAAGATATTTTGA 
TCTCCCCAATACATTGATTTTGGTATAATAAATGTGAGGCTGTTTTGCAAACTTAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 104 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA65406 
<subunit 1 of 1, 222 aa, 1 stop 
<MW: 25794, pi: 6.24, NX(S/T): 1 

MPKTMHFLFRFIVFFYLWGLFTAQRQKKEESTEEVKIEVLHRPENCSKTSKKGDLLNAHYDG 
YLAKDGSKFYCSRTQNEGHPKWFVLGVGQVIKGLDIAMTDMCPGEKRKWIPPSFAYGKEGY 
AEGKIPPDATLIFEIELYAVTKGPRSIETFKQIDMDNDRQLSKAEINLYLQREFEKDEKPRD 
KS YQDAVLED I FKKNDHDGDGF I S PKEYNVYQHDEL 

Important features: 

Endoplasmic reticulum targeting sequence. 

amino acids 219-222 

N-glycosylation site. 

amino acids 45-48 

FKBP-type pept idyl -prolyl cis- trans isomerase 

amino acids 87-223, 129-142 

EF-hand calcium- binding domain proteins 

amino acids 202-214, 195-214 
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FIGURE 105 



CAGAAATGCAGGGACCATTGCTTCTTCCAGGCCTCTGCTTTCTGCTGAGCCTCTTTGGAGCT 
GTGACTCAGAAAACCAAAACTTCCTGTGCTAAGTGCCCCCCAAATGCTTCCTGTGTCAATAA 
CACTCACTGCACCTGCAACCATGGATATACTTCTGGATCTGGGCAGAAACTATTCACATTCC 
CCTTGGAGACATGTAACGCCAGGCATGGTGGCTCGCGCCTGTAATCCCAGTTCTTTGGGAAG 
CCAAGGCAGGTGGATCACCTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATAGTGAAAC 
CCCGTGTCTACTAAAAATACAAAAATCAGCCGGGCGTGGTGGTGCATGCCTGCAATCCCAGT 
TACTCGGGAGGCTGAGGCAGGAGAATCG CTTGAACT GAGGAGG CAGAAGTTGCAGTGAACCC 
AGATCCTGCCATTG GACT C CAGCATGGATG AC AG AG CAAG ACTC CGTCT CAAAAAGAAAAGA 
TAGTTTCTTGTTTCATTTCGCGACTGCCCTCTCAGTGTTTCCTGGGATCCCCTCCCAAATAA 
AGTACTTATATTCTC 
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FIGURE 106 

MQGPLLLPGLCFLLSLFGAVTQKTKTSCAKCPPNAS CVNNTHCTCNHGYTSGSGQKLFTFPL 
ETCNARHGGSRL 

Signal peptide: 

amino acids 1-18 
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FIGURE 107 

CAAGCAGGTCATCCCCTTGGTGACCTTCAAAGAGAAGCAGAGAGGGCAGAGGTGGGGGGCAC 
AGGGAAAGGGTGACCTCTGAGATTCCCCTTTTCCCCCAGACTTTGGAAGTGACCCACCATGG 
GGCTCAGCATCTTTTTGCTCCTGTGTGTTCTTGGGCTCAGCCAGGCAGCCACACCGAAGATT 
TTCAATGGCACTGAGTGTGGGCGTAACTCACAGCCGTGGCAGGTGGGGCTGTTTGAGGGCAC 
CAGCCTGCGCTGCGGGGGTGTCCTTATTGACCACAGGTGGGTCCTCACAGCGGCTCACTGCA 
GCGGCAGCAGGTACTGGGTGCGCCTGGGGGAACACAGCCTCAGCCAGCTCGACTGGACCGAG 
CAGATCCGGCACAGCGGCTTCTCTGTGACCCATCCCGGCTACCTGGGAGCCTCGACGAGCCA 
CGAGCACGACCTCCGGCTGCTGCGGCTGCGCCTGCCCGTCCGCGTAACCAGCAGCGTTCAAC 
CCCTGCCCCTGCCCAATGACTGTGCAACCGCTGGCACCGAGTGCCACGTCTCAGGCTGGGGC 
ATCACCAACCACCCACGGAACCCATTCCCGGATCTGCTCCAGTGCCTCAACCTCTCCATCGT 
CTCCCATGCCACCTGCCATGGTGTGTATCCCGGGAGAATCACGAGCAACATGGTGTGTGCAG 
GCGGCGTCCCGGGGCAGGATGCCTGCCAGGGTGATTCTGGGGGCCCCCTGGTGTGTGGGGGA 
GTCCTTCAAGGTCTGGTGTCCTGGGGGTCTGTGGGGCCCTGTGGACAAGATGGCATCCCTGG 
AGTCTACACCTATATTTGCAAGTATGTGGACTGGATCCGGATGATCATGAGGAACAACTGAC 
CTGTTTCCTCCACCTCCACCCCCACCCCTTAACTTGGGTACCCCTCTGGCCCTCAGAGCACC 
AATATCTCCTCCATCACTTCCCCTAGCTCCACTCTTGTTGGCCTGGGAACTTCTTGGAACTT 
TAACTCCTGCCAGCCCTTCTAAGACCCACGAGCGGGGTGAGAGAAGTGTGCAATAGTCTGGA 
ATAAATATAAATGAAGGAGGGGCAAAAAAAAAAAAAA 
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FIGURE 108 



MGLSIFLLLCVLGLSQAATPKIFNGTECGRNSQPWQVGLFEGTSLRCGGVLIDHRWVLTAAH 
CSGSRYWVRLGEHSLSQLDWTEQIRHSGFSVTHPGYLGASTSHEHDLRLLRLRLPVRVTSSV 
QPLPLPNDCATAGTECHVSGWGITNHPRNPFPDLLQCLNLSIVSHATCHGVYPGRITSNMVC 
AGGVPGQDACQGDSGGPLVCGGVLQGLVSWGSVGPCGQDGIPGVYTYICKYVDWIRMIMRNN 

Signal peptide: 

amino acids 1-17 
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FICTJRE 109 



GCGGCCACACGCAGCTAGCCGGAGCCCGGACCAGGCGCCTGTGCCTCCTCCTCGTCCCTCGC 
CGCGTCCGCGAAGCCTGGAGCCGGCGGGAGCCCCGCGCTCGCCATGTCGGGCGAGCTCAGCA 
ACAGGTTCCAAGGAGGGAAGGCGTTCGGCTTGCTCAAAGCCCGGCAGGAGAGGAGGCTGGCC 
GAGATCAACCGGGAGTTTCTGTGTGACCAGAAGTACAGTGATGAAGAGAACCTTCCAGAAAA 
GCTCACAGCCTTCAAAGAGAAGTACATGGAGTTTGACCTGAACAATGAAGGCGAGATTGACC 
TGATGT CTTTAAAGAGG ATGAT GG AG AAGC TTGGTGTC CC CAAG AC CC AC CTGGAGATGAAG 
AAG ATGATCTCAGAGGTG AC AGGAGGGGTCAGTG AC ACTATAT C CTACCGAGACTTTGTGAA 
CATGATGCTGGGGAAACGGTCGGCTGTCCTCAAGTTAGTCATGATGTTTGAAGGAAAAGCCA 
ACGAGAGCAGC CC CAAG C CAGTTGG C CC CC CT CC AG AG AG AGAC ATTG CTAGC CTG CCCTGA 
GGACCCCGCCTGGACTCCCCAGCCTTCCCACCCCATACCTCCCTCCCGATCTTGCTGCCCTT 
CTTG AC AC ACTGTGAT CT CT CT CT CT CTC ATTTGTTTGGT CATTGAGGGTTTGTTTGTGTTT 
TCATCAATGTCTTTGTAAAGCACAAATTATCTGCCTTAAAGGGGCTCTGGGTCGGGGAATCC 
TGAGCCTTGGGTCCCCTCCCTCTCTTCTTCCCTCCTTCCCCGCTCCCTGTGCAGAAGGGCTG 
ATATCAAACCAAAAACTAGAGGGGGCAGGGCCAGGGCAGGGAGGCTTCCAGCCTGTGTTCCC 
CTCACTTGGAGGAACCAGCACTCTCCATCCTTTCAGAAAGTCTCCAAGCCAAGTtCAGGCTC 
ACTGACCTGGCTCTGACGAGGACCCCAGGCCACTCTGAGAAGACCTTGGAGTAGGGACAAGG 
CTGCAGGGCCTCTTTCGGGTTTCCTTGGACAGTGCCATGGTTCCAGTGCTCTGGTGTCACCC 
AGGACACAGCCACTCGGGGCCCCGCTGCCCCAGCTGATCCCCACTCATTCCACACCTCTTCT 
CATCCTCAGTGATGTGAAGGTGGGAAGGAAAGGAGCTTGGCATTGGGAGCCCTTCAAGAAGG 
TACCAGAAGGAACCCTCCAGTCCTGCTCTCTGGCCACACCTGTGCAGGCAGCTGAGAGGCAG 
CGTGCAGCCCTACTGTCCCTTACTGGGGCAGCAGAGGGCTTCGGAGGCAGAAGTGAGGCCTG 
GGGTTTGGGGGGAAAGGTCAGCTCAGTGCTGTTCCACCTTTTAGGGAGGATACTGAGGGGAC 
CAGGATGGGAGAATGAGGAGTAAAATG CT C ACGG CAAAGTCAGC AGCACTGGTAAG CCAAGA 
CTGAGAAATACAAGGTTGCTTGTCTGACCCCAATCTGCTTGAAAAAAAAAAAAAAAAAA 
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FIGURE 110 

MSGELSNRFQGGKAFGLLKARQERRLAEINREFLCDQKYSDEENLPEKLTAFKEKYMEFDLN 
NEGEIDLMSLKRMMEKLGVPKTHLEMKKMISEVTGGVSDTISYRDFVNMMLGKRSAVLKLVM 
MFEGKANESSPKPVGPPPERDIASLP 
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FIGURE 111A 

CGCGCTCCCCGCGCGCCTCCTCGGGCTCCACGCGTCTTGCCCCGCAGAGGCAGCCTCCTCCA 
GGAGCGGGGCCCTGCACACCATGGCCCCCGGGTGGGCAGGGGTCGGCGCCGCCGTGCGCGCC 
CGCCTGGCGCTGGCCTTGGCGCTGGCGAGCGTCCTGAGTGGGCCTCCAGCCGTCGCCTGCCC 
CACCAAGTGTACCTGCTCCGCTGCCAGCGTGGACTGCCACGGGCTGGGCCTCCGCGCGGTTC 
CTCGGGGCATCCCCCGCAACGCTGAGCGCCTTGACCTGGACAGAAATAATATCACCAGGATC 
ACCAAGATGGACTTCGCTGGGCTCAAGAACCTCCGAGTCTTGCATCTGGAAGACAACCAGGT 
CAGCGTCATCGAGAGAGGCGCCTTCCAGGACCTGAAGCAGCTAGAGCGACTGCGCCTGAACA 
AGAATAAG CTGC AAGT CCTT C C AG AATTGC TTTT C CAGAG CA CG CCGAAGCTCACCAGACTA 
GATTTGAGTGAAAACCAGATCCAGGGGATCCCGAGGAAGGCGTTCCGCGGCATCACCGATGT 
GAAGAACCTG CAACTGGACAACAACCACATCAGC TGCATTGAAGATGGAGCCTTCCGAGCGC 
TGCGCGATTTGGAGATCCTTACCCTCAACAACAACAACATCAGTCGCATCCTGGTCACCAGC 
TTCAACCACATGCCGAAGATCCGAACTCTGCGCCTCCACTCCAACCACCTCTACTGCGACTG 
CCACCTGGCCTGGCTCTCGGATTGGCTGCGACAGCGACGGACAGTTGGCCAGTTCACACTCT 
GCATGGCTCCTGTGCATTTGAGGGGCTTCAACGTGGCGGATGTGCAGAAGAAGGAGTACGTG 
TGCCCAGCCCCCCACTCGGAGCCCCCATCCTGCAATGCCAACTCCATCTCCTGCCCTTCGCC 
CTGCACGTGCAGCAATAACATCGTGGACTGTCGAGGAAAGGGCTTGATGGAGATTCCTGCCA 
ACTTGCCGGAGGGCATCGTCGAAATACGCCTAGAACAGAACTCCATCAAAGCCATCCCTGCA 
GG AG CCTT CACCCAGTACAAGAAACTGAAG CG AATAGACAT C AG CAAG AAT CAGAT AT CGGA 
TATTGCTCCAGATGCCTTCCAGGGCCTGAAATCACTCACATCGCTGGTCCTGTATGGGAACA 
AGATCACCGAGATTGCCAAGGGACTGTTTGATGGGCTGGTGTCCCTACAGCTGCTCCTCCTC 
AATGCCAACAAGATCAACTGCCTGCGGGTGAACACGTTTCAGGACCTGCAGAACCTCAACTT 
GCTCTCCCTGTATGACAACAAGCTGCAGACCATCAGCAAGGGGCTCTTCGCCCCTCTGCAGT 
CCATCCAGACACTCCACTTAGCCCAAAACCCATTTGTGTGCGACTGCCACTTGAAGTGGCTG 
GCCGACTACCTCCAGGACAACCCCATCGAGACAAGCGGGGCCCGCTGCAGCAGCCCGCGCCG 
ACTCGCCAACAAGCGCATCAGCCAGATCAAGAGCAAGAAGTTCCGCTGCTCAGGCTCCGAGG 
ATTACCGCAGCAGGTTCAGCAGCGAGTGCTTCATGGACCTCGTGTGCCCCGAGAAGTGTCGC 
TGTGAGGGCACGATTGTGGACTGCTCCAACCAGAAGCTGGTCCGCATCCCAAGCCACCTCCC 
TGAATATGTC AC CG AC CTGCGACTGAATGACAATGAGGTATCTGTT CTGGAGGC CACTGGC A 
TCTT CAAGAAGTTG CC CAAC CTG CGG AAAATAAATCTGAGTAACAATAAGAT CAAGGAGGTG 
CG AGAGGG AG CTTT CG ATGG AG CAG CCAGCGTGCAGGAGCTGATGCTGACAGGG AACC AG CT 
GGAGACCGTGCACGGGCGCGTGTTCCGTGGCCTCAGTGGCCTCAAAACCTTGATGCTGAGGA 
GTAACTTGATCAGCTGTGTGAGTAATGACACCTTTGCCGGCCTGAGTTCGGTGAGACTGCTG 
TCCCTCTATGACAATCGGATCACCACCATCACCCCTGGGGCCTTCACCACGCTTGTCTCCCT 
GTCCACCATAAACCTCCTGTCCAACCCCTTCAACTGCAACTGCCACCTGGCCTGGCTCGGCA 
AGTGGTTGAGGAAGAGGCGGATCGTCAGTGGGAACCCTAGGTGCCAGAAGCCATTTTTCCTC 
AAGGAGATTCCCATCCAGGATGTGGCCATCCAGGACTTCACCTGTGATGGCAACGAGGAGAG 
TAGCTGCCAGCTGAGCCCGCGCTGCCCGGAGCAGTGCACCTGTATGGAGACAGTGGTGCGAT 
GCAGCAACAAGGGGCTCCGCGCCCTCCCCAGAGGCATGCCCAAGGATGTGACCGAGCTGTAC 
CTGGAAGGAAACCACCTAACAGCCGTGCCCAGAGAGCTGTCCGCCCTCCGACACCTGACGCT 
TATTGACCTGAGCAACAACAGCATCAGCATGCTGACCAATTACACCTTCAGTAACATGTCTC 
ACCTCTCCACTCTGATCCTGAGCTACAACCGGCTGAGGTGCATCCCCGTCCACGCCTTCAAC 
GGGCTGCGGTCCCTGCGAGTGCTAACCCTCCATGGCAATGACATTTCCAGCGTTCCTGAAGG 
CTCCTTCAACGACCTCACATCTCTTTCCCATCTGGCGCTGGGAACCAACCCACTCCACTGTG 
ACTGCAGTCTTCGGTGGCTGTCGGAGTGGGTGAAGGCGGGGTACAAGGAGCCTGGCATCGCC 
CGCTGCAGTAGCCCTGAGCCCATGGCTGACAGGCTCCTGCTCACCACCCCAACCCACCGCTT 
CCAGTGCAAAGGGCCAGTGGACATCAACATTGTGGCCAAATGCAATGCCTGCCTCTCCAGCC 
CGTGCAAGAATAACGGGACATGCACCCAGGACCCTGTGGAGCTGTACCGCTGTGCCTGCCCC 
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FIGURE 111B 



TACAGCTACAAGGGCAAGGACTGCACTGTGCCCATCAACACCTGCATCCAGAACCCCTGTCA 
GCATGGAGGCACCTGCCACCTGAGTGACAGCCACAAGGATGGGTTCAGCTGCTCCTGCCCTC 
TGGGCTTTGAGGGG CAGCGGTGTGAGATCAAC C C AGATGACTGTGAGG AC AACG ACTG CGAA 
AACAATGCCACCTGCGTGGACGGGATCAACAACTACGTGTGTATCTGTCCGCCTAACTACAC 
AGGTGAGCTATGCGACGAGGTGATTGACCACTGTGTGCCTGAGCTGAACCTCTGTCAGCATG 
AGGCCAAGTGCATCCCCCTGGACAAAGGATTCAGCTGCGAGTGTGTCCCTGGCTACAGCGGG 
AAGCTCTGTGAGACAGACAATGATGACTGTGTGGCCCACAAGTGCCGCCACGGGGCCCAGTG 
CGTGGACACAATCAATGGCTACACATGCACCTGCCCCCAGGGCTTCAGTGGACCCTTCTGTG 
AACACCCCCCACCCATGGTCCTACTGCAGACCAGCCCATGCGACCAGTACGAGTGCCAGAAC 
GGGGCCCAGTGCATCGTGGTGCAGCAGGAGCCCACCTGCCGCTGCCCACCAGGCTTCGCCGG 
CCCCAGATGCGAGAAGCTCATCACTGTCAACTTCGTGGGCAAAGACTCCTACGTGGAACTGG 
CCTCCGCCAAGGTCCGACCCCAGGCCAACATCTCCCTGCAGGTGGCCACTGACAAGGACAAC 
GGCATCCTTCTCTACAAAGGAGACAATGACCCCCTGGCACTGGAGCTGTACCAGGGCCACGT 
GCGGCTGGTCTATGACAGCCTGAGTTCCCCTCCAACCACAGTGTACAGTGTGGAGACAGTGA 
ATGATGGGCAGTTTCACAGTGTGGAGCTGGTGACGCTAAACCAGACCCTGAACCtAGTAGTG 
GACAAAGGAACT CC AAAG AG C CTGGGGAAGCT CC AGAAGCAGCCAGCAGT GGGC AT CAAC AG 
CCCCCTCTACCTTGGAGGCATCCCCACCTCCACCGGCCTCTCCGCCTTGCGCCAGGGCACGG 
ACCGGCCTCTAGGCGGCTTCCACGGATGCATCCATGAGGTGCGCATCAACAACGAGCTGCAG 
GACTTCAAGGCCCTCCCACCACAGTCCCTGGGGGTGTCACCAGGCTGCAAGTCCTGCACCGT 
GTGCAAGCACGGCCTGTGCCGCTCCGTGGAGAAGGACAGCGTGGTGTGCGAGTGCCGCCCAG 
GCTGGACCGGCCCACTCTGCGACCAGGAGGCCCGGGACCCCTGCCTCGGCCACAGATGCCAC 
CATGGAAAATGTGTGGCAACTGGGACCTCATACATGTGCAAGTGTGCCGAGGGCTATGGAGG 
GGACTTGTGTGACAACAAGAATGACTCTGCCAATGCCTGCTCAGCCTTCAAGTGTCACCATG 
GGCAGTGCCACATCTCAGACCAAGGGGAGCCCTACTGCCTGTGCCAGCCCGGCTTTAGCGGC 
GAGCACTGCCAACAAG AGAAT C CG TGC CTGGGAC AAGTAGTC CGAG AGGTGATCCGCCGCCA 
GAAAGGTTATGCATCATGTGCCACAGCCTCCAAGGTGCCCATCATGGAATGTCGTGGGGGCT 
GTGGGCCCCAGTGCTGCCAGCCCACCCGCAGCAAGCGGCGGAAATACGTCTTCCAGTGCACG 
GACGGCTCCTCGTTTGTAGAAGAGGTGGAGAGACACTTAGAGTGCGGCTGCCTCGCGTGTTC 
CTAAGCCCCTGCCCGCCTGCCTGCCACCTCTCGGACTCCAGCTTGATGGAGTTGGGACAGCC 
ATGTGGGACCCCCTGGTGATTCAGCATGAAGGAAATGAAGCTGGAGAGGAAGGTAAAGAAGA 
AGAGAATATTAAGTATATTGTAAAATAAACAAAAAATAGAACTTAAAAAAAAAAAAAAAAAA 
AAAAAA 
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FIGURE 112 



MAPGWAGVGAAVRARLAIiALAI^ 

AERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNKLQVL 
PE LLFQST P KLTRLDLS ENQ I QG I PRKAFRG I TD VKNLQL DNNH I S C I EDGAFRALRDLE I L 
TLNNNNISRILVTSFNHMPKIRTLRLHSNHLYCDCHIiAWLSDWLRQRRTVGQFTLCMAPVHLj 
RGFNVADVQKKEYVCPAPHSEPPSCNANSISCPSPCTCSNNIVDCRGKGLMEIPANLPEGIV 
EIRLEQNSIKAIPAGAFTQYKKLKRIDISKNQISDIAPDAFQGLKSLTSLVLYGNKITEIAK 
GLFDGLVSLQLLLLNANKINCLRVNTFQDLQNLNLLSLYDNKLQTISKGLFAPLQSIQTLHL 
AQNP FVCD CHLKWLAD YLQDNP I ETSGARCS S PRRLANKR I S Q I KSKKFRCS GS ED YRS RFS 
SECFMDLVCPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTDLRLNDNEVSVLEATGIFKKLPN 
LRKINLSNNKIKEWEGAFDGAASVQELMLTGNQLETVHGRVFRGLSGLKTLML^SNLISCV 
SNDTFAGLSS VRLLSL YDNR I TT I TPGAFTTL VSLST I NLLSNPFNCNCHLAWLGKWLRKRR 
IVSGNPRCQKPFFLKEIPIQDVAIQDFTCDGNEESSCQLSPRCPEQCTCMETWRCSNKGLR 
ALPRGMPKDVTEIiYLEGNHLTAVPRELSALRHLTLIDLSNNSISMLtTNYTFSNMSHLSTliIL 
SYNRLRCI PVHAFNGLRSLRVLTLHGND I S SVPEGS FNDLTSLSHLALGTNPLHCDCSLRWL 
SEWVKAGYKEPGIARCSSPEPMADRLLLTTPTHRFQCKGPVDINIVAKCNACLSSPCKNNGT 
CTQDPVELYRCACPYSYKGKDCTVPINTCIQNPCQHGGTCHLSDSHKDGFSCSCPLGFEGQR 
CEINPDDCEDNDCENNATCVDGINNYVCICPPNYTGELCDEVIDHCVPELNLCQHEAKCIPL 
DKGFSCECVPGYSGKLCETDNDDCVAHKCRHGAQCVDTINGYTCTCPQGFSGPFCEHPPPMV 
LLQTSPCDQYECQNGAQCIWQQEPTCRCPPGFAGPRCEKXjITVNFVGKDSYVEIiASAKVRP 
QANISLQVATDKDNGILLYKGDNDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHS 
VELVTLNQTLNLWDKGTPKSLGKLQKQPAVGINSPLYLGGIPTSTGLSALRQGTDRPLGGF 
HGCIHEVRINNELQDFKALPPQSLGVSPGCKSCTVCKHGLCRSVEKDSWCECRPGWTGPLC 
DQEARDPCLGHRCHHGKCVATGTSYMCKCAEGYGGDLCDNKNDSANACSAFKCHHGQCHISD 
QGEPYCLCQPGFSGEHCQQENPCLGQWREVIRRQKGYASCATASKVPIMECRGGCGPQCCQ 
PTRSKRRKYVFQCTDGSSFVEEVERHLECGCLACS 

Signal peptide: 

amino acids 1-27 
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FIGURE 113 



GG AT GCAGGACGCTCCCCTGAGCTGCCTGTCACCGACTAGGTGG AG CAGTGTTTCTTCCGC A 
GACTC AACTG AGAAGT CAGCCT CTGGGGC AGG C AC C AGGAAT CTG C CTTT T C AGTT CTGT CT 
CCGGCAGGCTTTGAGGATGAAGGCTGCGGGCATTCTGACCCTCATTGGCTGCCTGGTCACAG 
GCGCCGAGTCCAAAATCTACACTCGTTGCAAACTGGCAAAAATATTCTCGAGGGCTGGCCTG 
GACAATTACTGGGGCTTCAGCCTTGGAAACTGGATCTGCATGGCATATTATGAGAGCGGCTA 
CAACACCACAGCCCCGACGGTCCTGGATGACGGCAGCATCGACTATGGCATCTTCCAGATCA 
ACAGCTTCGCGTGGTGCAGACGCGGAAAGCTGAAGGAGAACAACCACTGCCATGTCGCCTGC 
TCAG CCTTGATC ACTGATGAC CTCAC AGATGCAATTATCTGTGCCAGGAAAATTGTTAAAGA 
G ACACAAGGAATG AACTATTGG C AAGG CTGGAAGAAAC ATTGTGAGGGCAGAG ACC TGTC CG 
AGTGGAAAAAAGGCTGTGAGGTTTCCTAAACTGGAACTGGACCCAGGATGCTTTGCAGCAAC 
GCCCTAGGATTTGCAGTGAATGTCCAAATGCCTGTGTCATCTTGTCCCGTTTCCTCCCAATA 
TT C CTT CTCAAACTTGGAGAGGGAAAATTAAGCTATACTTTT AAG AAAAT AAAT AT TTCC AT 
TTAAATGTC 
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FIGURE 114 



MKAAGILTLIGCLVTGAESKIYTRCKIiAKIFSRAGLDNYWGFSLGNWICMAYYESGYNTTAP 
TVLDDGS I DYG I FQ INSFAWCRRGKLKENNHCHVACSAL I TDDLTDAI I CARK I VKETQGMN 
YWQGWKKHCEGRDLSEWKKGCEVS 

Signal peptide: 

amino acids 1-19 



/XT/* 
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FIGURE 1X5 



CAGGCCATTTGCATCCCACTGTCCTTGTGTTCGGAGCCAGGCCACACCGTCCTCAGCAGTGT 
CATGTGTTAAAAACGCCAAGCTGAATATATCATGCCCCTATTAAAACTTGTACATGGCTCCC 
CATTGGTTTTTGGAGAAAAGTTCAAGCTTTTTACCTTGGTGTCTGCCTGTATCCCAGTGTTC 
AGGCTGGCTAGACGGCGGAAGAAGATCCTATTTTACTGTCACTTCCCAGATCTGCTTCTCAC 
CAAGAGAGATTCTTTT CTT AAACG ACTATAC AGG GC CC C AATTG ACTGGAT AGAGGAATACA 
CCACAGG CATGGCAGACTGC AT CTTAGT CAAC AGCC AGTT C ACAG CTG CTGTTTTTAAGGAA 
ACATTCAAGTCCCTGTCTCACATAGACCCTGATGTCCTCTATCCATCTCTAAATGTCACCAG 
CTTTGACTCAGTTGTTCCTGAAAAGCTGGATGACCTAGTCCCCAAGGGGAAAAAATTCCTGC 
TG CT CT C C ATCAAC AG ATACGAAAGG AAG AAAAAT CTG ACTTTGG C ACTGGAAG CCCT AGT A 
CAG CTG CGTGGAAG ATTG AC AT CC CAAG ATTGGGAG AGGGTTCAT CTG AT CGTGGCAGGTGG 
TT ATGACGAG AG AGT C CTGGAGAATGTGG AACATTATCAGG AATTGAAGAAAATGGTC CAAC 
AGTC CGACCTTGGCCAGTATGTGACCTTCTTGAGGTCTTTCT CAGACAAACAGAAAATCTCC 
CTCCTCCACAGCTGCACGTGTGTGCTTTACACACCAAGCAATGAGCACTTTGGCATTGTCCC 
TCTGGAAGCCATGTACATGCAGTGCC CAGT CATTGCTGTTAATTCGGGTGGACC CTTGGAGT 
CCATTGACCACAGTGTCACAGGGTTTCTGTGTGAGCCTGACCCGGTGCACTTCTCAGAAGCA 
ATAGAAAAGTTC ATC CGTGAAC CTTC CTTAAAAG CCAC CATGGG CCTGG CTGGAAGAGCCAG 
AGTGAAGG AAAAATTT TC CC CTGAAG CATTT ACAGAAC AGCT CT ACCGATATGTTACCAAAC 
TGCTGGT ATAA TCAGATTGTTTTTAAG AT CTCC ATT AATGT CATTTTTATGGATTGTAGACC 
CAGTTTTGAAACCAAAAAAGAAACCTAGAATCTAATGCAGAAGAGATCTTTTAAAAAATAAA 
CTTGAGTCTTGAATGTGAGCCACTTTCCTATATACCACACCTCCCTGTCCACTTTTCAGAAA 
AACCATGTCTTTTATGCTATAATCATTCCAAATTTTGCCAGTGTTAAGTTACAAATGTGGTG 
T C ATTC CATGTT CAGC AGAGTATTTTAATTATATTTTC TCGGGATTATTG CT CTTCTGTCTA 
TAAATTTTGAATGATACTGTGCCTTAATTGGTTTTCATAGTTTAAGTGTGTATCATTATCAA 
AGTTGATTAATTTGGCTTCATAGTATAATGAGAGCAGGGCTATTGTAGTTCCCAGATTCAAT 
CCACCGAAGTGTTCACTGTCATCTGTTAGGGAATTTTTGTTTGTCCTGTCTTTGCCTGGATC 
CAT AGCGAGAGTGCT CTGTATTTTTTTT AAGATAATTTGTAT TT TTGC AC ACTG AGAT AT AA 
TAAAAGGTGTTTATCATAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 

MPLLKLVHGS PLVFGE KFKLFTLVS AC I PVFRLARRRKKI LF YCHF PDLLLTKRDS FLKRL Y 
RAPIDWIEEYTTGMADCILVNSQFTAAVFKETFKSLSHIDPDVLYPSLNVTSFDSWPEKLD 
DLVPKGKKFLLLSINRYERKKNLTLALEALVQLRGRLTSQDWERVHLIVAGGYDERVLENVE 
HYQELKKMVQQSDLGQYVTFLRSFSDKQKISLLHSCTCVLYTPSNEHFGIVPLEAMYMQCPV 
I AVNSGGPLES I DHSVTGFL CE PD PVHFSE AI EKF I RE PS LKATMGLAGRARVKEKFS PEAF 
TEQLYRYVTKLLV 

Signal peptide: 

amino acids 1-15 
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FIGURE 117 



GACTACGCCGATCCGAGACGTGGCTCCCTGGGCGGCAGAACCATGTTGGACTTCGCGATCTT 
CGCCGTTACCTTCTTGCTGGCGTTGGTGGGAGCCGTGCTCTACCTCTATCCGGCTTCCAGAC 
AAG CTG CAGGAATTCCAGGGATTACT C C AACTGAAG AAAAAG ATGGTAAT CT TC CAGATATT 
GTGAATAGTGGAAGTTTGCATGAGTTCCTGGTTAATTTGCATGAGAGATATGGGCCTGTGGT 
CTCCTTCTGGTTTGGCAGGCGCCTCGTGGTTAGTTTGGGCACTGTTGATGTACTGAAGCAGC 
ATAT C AATCC CAAT AAGACATC GG AC CCTTTTGAAACC ATGCTG AAGT CATTATTAAGGTAT 
CAATCTGGTGGTGGCAGTGTGAGTGAAAACCACATGAGGAAAAAATTGTATGAAAATGGTGT 
GACTGATT CTCTGAAGAGTAAC TTTG CC CT CCTC CT AAAGCTTTC AGAAG AATTATTAGATA 
AATGGCTCTCCTACCCAGAGACCCAGCACGTGCCCCTCAGCCAGCATATGCTTGGTTTTGCT 
ATGAAGTCTGTT AC AC AG ATGGT AATGGGTAGT ACATTTG AAGATGAT C AGG AAGT CATT CG 
CTTCCAGAAGAATCATGGCACAGTTTGGTCTGAGATTGGAAAAGGCTTTCTAGATGGGTCAC 
TTGATAAAAACATGACTCGGAAAAAACAATATGAAGATGCCCTCATGCAACTGGAGTCTGTT 
TT AAGG AACATCATAAAAGAACGAAAAGG AAGGAACTT C AGT CAACAT ATTT TC ATTG ACT C 
CTTAGTAC AAGGGAACCTTAATGAC C AACAGATC CTAG AAGACAGT ATGATATTTTCT CTGG 
CCAGTTGCATAATAACTGCAAAATTGTGTACCTGGGCAATCTGTTTTTTAACCACCTCTGAA 
GAAGTTCAAAAAAAATTATATGAAGAGATAAACCAAGTTTTTGGAAATGGTCCTGTTACTCC 
AGAGAAAATTGAGC AG CT CAGATATTGTCAGCAT GTGCTTTGTG AAACTGTTCGAACTGC CA 
AACTGACT CCAGTTTCTG CCCAGCTTCAAGATATTGAAGGAAAAATTGACCGATTTATTATT 
CCTAGAGAGACCCTCGTCCTTTATGCCCTTGGTGTGGTACTTCAGGATCCTAATACTTGGCC 
AT CT CC AC AC AAGTTTGATCCAGATCGGTTTG ATGATGAATT AGT AATGAAAACTTTTTC CT 
CACTTGGATT CT CAGG CACAC AGGAGTGT C CAGAGTTGAGGTTTGC AT ATAT GGTGACCACA 
GTACTTCTTAGTGTATTGGTGAAGAGACTGCACCTACTTTCTGTGGAGGGACAGGTTATTGA 
AACAAAGTATGAACTGGTAACATCATCAAGGGAAGAAGCTTGGATCACTGTCTCAAAGAGAT 
ATTAAAATTTTATACATTTAAAATCATTGTTAAATTGATTGAGGAAAACAACCATTTAAAAA 
AAAT CT ATGTTGAAT CCTTTTAT AAAC CAGTATC ACTTTGTAAT ATAAAC AC CTATTTGTAC 
TTAA 
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FIGURE 118 



MLDFAI FAVTFLLALVGAVLYL YPASRQ AAG I PG I T PTEE KDGNLP D I VNSGSLHE FLVNLH 
ERYGPWSFWFGRRLWSLGTVDVLKQHINPNKTSDPFETMLKSLLRYQSGGGSVSENHMRK 
KLYENGVTDSLKSNFALLLKLSEELLDKWLSYPETQHVPLSQHMLGFAMKSVTQMVMGSTFE 
DDQEVIRFQKNHGTVWSEIGKGFLDGSLDKNMTRKKQYEDALMQLESVLRNIIKERKGRNFS 
QHIFIDSLVQGNLNDQQILEDSMIFSLASCIITAKLCTWAICFLTTSEEVQKKLYEEINQVF 
GNGPVTPEKIEQLRYCQHVLCETVRTAKLTPVSAQLQDIEGKIDRFXIPRETLVLYALGWL 
QDPNTWPSPHKFDPDRFDDELVMKTFSSLGFSGTQECPELRFAYMVTTVLLSVLVKRLHLLS 
VEGQV I ETKYELVTS S REEAW ITVS KRY 

Signal peptide: 

amino acids 1-18 

Transmembrane domain : 

amino acids 271-290 
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FIGURE 119 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGA 
GACCGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTC 
CTGACGCTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACAT 
ACAGGCCTGCCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGG 
TGGCCGCGCTCTCTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGA 
GTCTCCATGTTCAACAGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGT 
GGCCCTGTCCTTCTTCATATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCT 
TCTGCAGTGCCCTTCCAGCTGTCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAA 
AAGAAACCCTTCTGATTACCTTCATGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGG 
CCGCTTCGTATTCCTGGAAGAAGGAAGGCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTC 
TGCTGGAGGATATGTGTTGGAATAATTACGTCTTGAGTCTGGGATTATCCGCATTGTATTTA 
GTGCTTTGTAATAAAATATGTTTTGTAGTT^ACATTAAGACTTATATACAGTTTTAGGGGACA 
ATTAAAAAAAAAAAA 
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FIGURE 120 



MGRVSGLVPSRFLTLLAHLWVITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLG 
LFAVELAGFLSGVSMFNSTQSL I S IGAHCS AS VALS FF I F ERWE CTTYWY I FVFCSALPAVT 
EMALFVTVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II) , 51-66, 107-124 
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FIGURE 121 



TCCCGGACCCTGCCGCCCTGCCACTATGTCCCGCCGCTCTATGCTGCTTGCCTGGGCTCTCC 
CCAGCCTCCTTCGACTCGGAGCGGCTCAGGAGACAGAAGACCCGGCCTGCTGCAGCCCCATA 
GTGCCCCGGAACGAGTGGAAGGCCCTGGCATCAGAGTGCGCCCAGCACCTGAGCCTGCCCTT 
ACGCTATGTGGTGGTATCGCACACGGCGGGCAGCAGCTGCAACACCCCCGCCTCGTGCCAGC 
AGCAGGCCCGGAATGTGCAGCACTACCACATGAAGACACTGGGCTGGTGCGACGTGGGCTAC 
AACTTCCTGATTGGAGAAGACGGGCTCGTATACGAGGGCCGTGGCTGGAACTTCACGGGTGC 
CCACTCAGGTCACTTATGGAACCCCATGTCCATTGGCATCAGCTTCATGGGCAACTACATGG 
ATCGGGTGCCCACACCCCAGGCCATCCGGGCAGCCCAGGGTCTACTGGCCTGCGGTGTGGCT 
CAGGGAGCCCTGAGGTCCAACTATGTGCTCAAAGGACACCGGGATGTGCAGCGTACACTCTC 
TCCAGGCAACCAGCTCTACCACCTCATCCAGAATTGGCCACACTACCGCTCCCCCTGAGGCC 
CTGCTGATCCGCACCCCATTCCTCCCCTCCCATGGCCAAAAACCCCACTGTCTCCTTCTCCA 
ATAAAGATGTAGCTC 
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FIGURE 122 



MSRRSMLLAWALPSLLRLGAAQETEDPACCSPIVPRNEWKALASECAQHLSLPLRYVWSHT 
AGSSCOTPASCQQQARNVQHYHMKTLGWCDVGYNFLIGEDGLVYEGRGWNFTGAHSGHLWNP 
MSIGISFMGNYMDRVPTPQAIRAAQGLLACGVAQGALRSNYVLKGHRDVQRTLSPGNQLYHL 
IQNWPHYRSP 

Signal peptide: 

amino acids 1-2 0 
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FIGURE 123 

CTGGGACC CCGAAAAG AGAAGGGGAGAGCG AGGGGACG AG AG CGGAGG AGGAAG ATGCAACT 
GACTCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGG 
ATGATGGTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCC 
CGGGTGCCTCGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCT 
CCTAGGGCTGCTGGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCC 
CGAACCACAGCCCCCCACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTAC 
TCCAACATCAAGACGGTGGCCCTGAAGCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAA 
TGGGAC CTTCAG CGT C CACTT C CAAC AC AATG CC AC AGGC CAGGGAAACAT CT CCATCAGC C 
TCGTGCCCCCCAGTAAAGCTGTAGAGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAG 
GCCTCCAAAATCTTCAACTGCCGGATGGAGTGGGAGAAGGTAGAACGGGGCCGCQGGACCTC 
GCTTTGCACCCACGACCCAGCCAAGATCTGCTCCCGAGACCACGCTCAGAGCTCAGCCACCT 
GGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTGTCTACATCGCCTTCTACAGCACGGACTAT 
CGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC CATAGTGATACCCC CTACTACCCATC 
TGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGCTGGAAGGACAGGCCTGCCCATGC 
AGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTCAGGCAGGGAGGGGGGTGGAG 
ACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAGAGAAAGGGTCCCAAGTG 
CTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGGAGGAGGAGTGGGCT 
CTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGGTCCCCGAGGCC 
TGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGCCCTTGGTT 
CTTG CC AT CC TG AGGAAAGATAGC AACAGGGAGGGGGAGATTT CAT CAGTGTGGACAGCCTG 
TCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGGGC 
CAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATG 
GCATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTC 
CAGCCAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATT 
GCTGATGGCACACCCATCCTTAAGCTAAGACAGG ACGATTGTGGTC CT CC CACACTAAGGCC 
ACAGCCCATCCGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAG 
CATCCATGTCCCGGAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCC 
CGGATCTGGATGGCGCCGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGA 
GCATGTGCTGGATCTGTTCTGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGT 
GAAACCGCTGATTGCTGACTTTTGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTT 
GCCCCGGGGCA 
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FIGURE 124 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA66521 
xsubunit 1 of 1, 252 aa, 1 stop 
><MW: 28127, pi: 8.91, NX(S/T): 5 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMAN 
STLLGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVD 
HGNGTFSVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGR 
RTSLCTHDPAKICSRDHAQSSATWSCSQPFKVVCVYIAFYSTDYRLVQKVCPDYNYHSDTPY 
YPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE 125 



GT GAATGTGAGGGTTTGATGACTTT CAG ATGT CT AGGAAC CAGAGTGGGTGCAGGGG C CC CA 
GGCAGGGCTGATTCTTGGGCGGAGGAGAGTAGGGTAAAGGGTTCTGCATGAGCTCCTTAAAG 
GACAAAGGTAACAGAGCCAG CGAG AG AG CT CGAGGGGAG ACTTTG ACTTCAAG C C ACAGAAT 
TGGTGGAAGTGTGCGCGCCGCCGCCGCCGTCGCTCCTGCAGCGCTGTCGACCTAGCCGCTAG 
CATCTTCCCGAGCACCGGGATCCCGGGGTAGGAGGCGACGCGGGCGAGCACCAGCGCCAGCC 
GGCTGCGGCTGCCCACACGGCTCACCATGGGCTCCGGGCGCCGGGCGCTGTCCGCGGTGCCG 
GCCGTGCTGCTGGTCCTCACGCTGCCGGGGCTGCCCGTCTGGGCACAGAACGACACGGAGCC 
CATCGTGCTGGAGGGCAAGTGTCTGGTGGTGTGCGACTCGAACCCGGCCACGGACTCCAAGG 
GCTCCTCTTCCTCCCCGCTGGGGATATCGGTCCGGGCGGCCAACTCCAAGGTCGCCTTCTCG 
GCGGTGCGGAGCAC CAACCACGAGCC AT CCGAGATGAGCAACAAGACGCGCATCATTTACTT 
CGATCAGATCCTGGTGAATGTGGGTAATTTTTTCACATTGGAGTCTGTCTTTGTAGCACCAA 
GAAAAG GAAT TTAC AGTTT C AGTTTT C ACGTGATTAAAGTCT ACCAGAGCCAAACT ATCCAG 
GTTAACTTGATGTTAAATGGAAAACCAGTAATATCTGCCTTTGCGGGGGACAAAGATGTTAC 
TCGTGAAGCTGCCACGAATGGTGTCCTGCTCTACCTAGATAAAGAGGATAAGGTTTACCTAA 
AACTGGAGAAAGGTAATTTGGTTGGAGGCTGGCAGTATTCCACGTTTTCTGGCTTTCTGGTG 
TTCCCC CTATAGGATT CAATTTCT CCATGATGTTCATCCAGGTGAGGGATGACC CACTCCTG 
AGTTATTGGAAGATCATTTTTTCATCATTGGATTGATGTCTTTTATTGGTTTCTCATGGGTG 
GATATGGATTCTAAGGATTCTAGCCTGTCTGAACCAATACAAAATTTCACAGATTATTTGTG 
TGTGTCTGTTTCAGTATATTTGGATTGGGACTCTAAGCAGATAATACCTATGCTTAAATGTA 
ACAGTCAAAAGCTGTCTGCAAGACTTATTCTGAATTTCATTTCCTGGGATTACTGAATTAGT 
TACAGATGTGGAATTTTATTTGTTTAGTTTTAAAAGACTGGCAACCAGGTCTAAGGATTAGA 
AAACTCTAAAGTTCTGACTTCAATCAACGGTTAGTGTGATACTGCCAAAGAACTGTATACTG 
TGTTAATATATTGATTATATTTGTTTTTATTCCTTTGGAATTAGTTTGTTTGGTTCTTGTAA 
AAAACTTGGATTTTTTTTTTCAGTAACTGGTATTATGTTTTCTCTTAAAATAAGGTAATGAA 
TGGCTTGCCCACAAATTTACCTTGACTACGATATCATCGACATGACTTCTCTCAAAAAAAAA 
GAATGCTTCATAGTTGTATTTTAATTGTATATGTGAAAGAGTCATATTTTCCAAGTTATATT 
TTCTAAGAAGAAGAATAGATCATAAATCTGACAAGGAAAAAGTTGCTTACCCAAAATCTAAG 
TGCTCAATCCCTGAGCCTCAGCAAAACAGCTCCCCTCCGAGGGAAATCTTATACTTTATTGC 
TCAACTTT AATTAAAATG ATTG AT AAT AAC CACTTTATTAAAAAC CTAAGGTTTTTTTTTTT 
TC CGTAGACATGAC CACT TT ATTAACTGGTGGTGGGATGCTGTTGTTTCTAATT ATAC CTAT 
TTTTCAAGGCTTCTGTTGTATTTGAAGTATCATCTGGTTTTGCCTTAACTCTTTAAATTGTA 
TATATTTATCTGTTTAGCTAATATTAAATTCAAATATCCCATATCTAAATTTAGTGCAATAT 
CT TGTCTTTTGTAT AGGT CATATG AATTCATAAAATTATT T ATGTCTGTT ATAG AATAAAG A 
TTAATATATGTTAAAAAAA 
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FIGURE 126 



MGSGRRALSAVPAVLLVLTLPGLP VWAQNDTE P I VLEGKCLWCDSNPATDS KGSS SS PLG I 
SVRAANSKVAFSAVRSTNHEPSEMSNKTRIIYFDQILVNVGNFFTLESVFVAPRKGIYSFSF 
HVIKVYQSQTIQVNLMLNGKPVISAFAGDKDVTREAATNGVLLYLDKEDKVYLKLEKGNLVG 
GWQYSTFSGFLVFPL 

Signal peptide: 

amino acids 1-2 7 
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FIGURE 12 7 



CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTC 
GCCCTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGC 
TTTCTTCTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTA 
TTGACAACAAAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTC 
TATATCCAAGAAATGTTCCGATTTGCATATTATAAACTCTTAAAAAAAGCCAGTGAAGGTTT 
GAAGAGTATAAACC CAGGTGAGACAG CACC CTCTATGCGACTGCTGGC CTATGTTTCTGGCT 
TGGGCTTTGGAATCATGAGTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGG 
CCAGGCACAGTGGGCATTCATGGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGAC 
GCTGGTCATTATCTTGCTGCATGTATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGA 
AAAAGTGGGGCATCCTCCTTATCGTTCTCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTC 
AT AAGTTCTT AT T ATGGAATAAAC CTGGCGT CAGCATTTATAAT CCTGGTG C TC ATGGGC AC 
CTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCGAAGCCTGAAACTCTGCCTGCTCTGCCAAG 
ACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGATAACCTCAGGGAACCAGCACTTCCCAA 
ACCGCAGACTACATCTTTAGAGGAAGCACAACTGTGCCTTTTTCTGAAAATCCCTTTTTCTG 
GTGGAATTGAGAAAGAAATAAAACTATGCAGATA 
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></usr/seqdb2/sst/DNA/Dnaseqs . min/ ss ♦ DNA66658 
xsubunit 1 of 1, 257 aa, 1 stop 
><MW: 28472, pi: 9.33, NX(S/T): 0 

MTAAVFFGCAFIAFGPALALYVFTIAIEPLRIIFLIAGAFFWLVSLLISSLVWFMARVIIDN 
KDGPTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGF 
GIMSGVFSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKW 
GILLIVLLTHLLVSAQTFISSYYGINI^SAFXILVLMGTWAFLAAGGSCRSLKIjCLLCQD^ 
FLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Gly cos aminoglyc an attachment site. 

amino acids 120-123 

Sodiununeurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 129 



CGGCAACCAGCCGCCGCCACCACCGCTGCCACTGCCGCCCTGCCGGGGCCATGTTCGCTCTGGGCTTGCCCTTCT 
TGGTGCTCTTGGTGGCCTCGGTCGAGAGCCATCTGGGGGTTCTGGGGCCCAAGAACGTCTCGCAGAAAGACGCCG 
AGTTTGAGCGCACCTACGTGGACGAGGTCAACAGCGAGCTGGTCAACATCTACACCTTCAACCATACTGTGACCC 
GCAACAGGACAGAGGGCGTGCGTGTGTCTGTGAACGTCCTGAACAAGCAGAAGGGGGCGCCGTTGCTGTTTGTGG 
TCCGCC^GAAGGAGGCTGTGGTGTCCTTCCAGGTGCCCCTAATCCTGCGAGGGATGTTTC^GCGCAAGTACCTCT 
ACCAAAAAGTGGAACGAACCCTGTGTCAGCCCCCCACCAAGAATGAGTCGGAGATTCAGTTCTTCTACGTGGATG 
TGTCCACCCTGTCACC^GTCAACACCACATACCAGCTCCGGGTCAGCCGCATGGACGATTTTGTGCTCAGGACTG 
GGGAGCAGTTCAGCTTCAATACCACAGCAGCACAGCCCCAGTACTTCAAGTATGAGTTCCCTGAAGGCGTGGACT 
CGGTAATTGTCAAGGTGACCTCCAACAAGGCCTTCCCCTGCTCAGTCATCTCCATTCAGGATGTGCTGTGTCCTG 
TCTATGACCTGGACAACAACGTAGCCTTCATCGGCATGTACCAGACGATGACCAAGAAGGCGGCCATCACCGTAC 
AGCGCAAAGACTTCCCCAGCAACAGCTTTTATGTGGTGGTGGTGGTGAAGACCGAAGACCAAGCCTGCGGGGGCT 
CCCTGCCTTTCTACCCCTTCGCAGAAGATGAACCGGTCGATCAAGGGCACCGCCAGAAAACCCTGTCAGTGCTGG 
TGTCTCAAGCAGTCACGTCTGAGGCATACGTCAGTGGGATGCTCTTTTGCCTGGGTATATTTCTCTCCTTTTACC 
TGCTGACCGTCCTCCTGGCCTGCTGGGAGAACTGGAGGCAGAAGAAGAAGACCCTGCTGGTGGCCATTGACCGAG 
CCTGCCCAGAAAGCGGTCACCCTCGAGTCCTGGCTGATTCTTT^ 

ATGGCTCCTTTGAGAATGTTTCTGGATCTACCGATGGTCTGGTTGACAGCGCTGGCACTGGGGACCTCTCTTACG 
GTTACCAGGGCCGCTCCTTTGAACCTGTAGGTACTCGGCCCCGAGTGGACTCCATGAGCTCTGTGGAGGAGGATG 
ACTACGACACATTGACCGACATCGATTCCGACAAGAATGTCATTCGCACCAAGCAATACCTCTATGTGGCTGACC 
TGGCACGGAAGGACAAGCGTGTTCTGCGGAAAAAGTACCAGATCTACTTCTGGAACATTGCCACCATTGCTGTCT 
TCTATGCCCTTCCTGTGGTGCAGCTGGTGATCACCTACCAGACGGTGGTGAATGTCACAGGGAATCAGGACATCT 
GCTACTAGAACTTCCTCTGCGCCCACCCACTGGGG^^ 

ACATCCTGCTGGGGCTGCTTTTCCTGCTCATCATCCTGC^CGGGAGATCJVACCACAACCGGGCCCTGCTGCGCA 
ATGACCTCTGTGCCCTGGAATGTGGGATCCCCAAACACTTTGGGCTTTTCTACGCCATGGGCACAGCCCTGATGA 
TGGAGGGGCTGCTGAGTGCTTGCTATCATGTGTGCCCCAACTATA 

ACATGATCGCCGGACTCTGCATGCTGAAGCTCTACCAGAAGCGGCACCCGGACATCAACGCCAGCGCCTACAGTG 
CCTACGCCTGCCTGGCCATTGTCATCTTCTTCTCTGTGCTGGGCGTGGTCTTTGGCAAAGGGAACACGGCGTTCT 
GGATCGTCTTCTCCATCATTCACATCATCGCCACCCTGCTCCTCAGCACGCAGCTCTATTACATGGGCCGGTGGA 
AACTGGACTCGGGGATCTTCCGCCGCATCCTCCACGTGCTCTACACAGACTGCATCCGGCAGTGCAGCGGGCCGC 
TCTACGTGGACCGCATGGTGCTGCTGGTCATGGGCAACGTCATCAACTGGTCGCTGGCTGCCTATGGGCTTATCA 
TGCGCCCCAATGATTTCGCTTCCTACTTGTTGGCCATTGGCATCTGCAACCTGCTCCTTTACTTCGCCTTCTACA 
TCATCATGAAGCTCCGGAGTGGGGAGAGGATCAAGCTCATCCCCCTGCTCTGCATCGTTTGCACCTCCGTGGTCT 
GGGGCTTCGCGCTCTTCTTCTTCTTCCAGGGACTCAGCACCTGGCAGAAAACCCCTGCAGAGTCGAGGGAGCACA 
ACCGGGACTGCATCCTCCTCGACTTCTTTGACGACCACGACA^ 

GGTCCTTCCTGGTGTTGCTGACAgTGGATGACGACCTGGATACTGTGCAGCGGGACAAGATCTATGTCTTCTAGC 

GTGGGGATGAGTCCCAGCACCGCTGCCCAGCACTGGATGGCAGCAGGACAGCCAGGTCTAGCTTAGG 

GGGACAGCCATGGGGTGGCATGGAACCTTGCAGCTGCCCTCTGCCGAGGAGCAGGCCTGCTCCCCTGGAACCCCC 

AGATGTTGGCCAT^TTGCTGCTTTCTTCTCAGTGTTGGGGCCTTCCATGGGCCCCTGTCCTTTGGCTCTCCATTT 

GTCCCTTTGCAAGAGGAAGGATGGAAGGGACACCCTCCCCATTTCATGCCTTGCATTTTGCCCGTCCTCCTCCCC 

ACAATGCCCCAGCCTGGGACCTAAGGCCTCTTTTTCCTCCCATACTCCCACTCCAGGGCCTAGTCTGGGGCCTGA 

ATCTCTGTCCTGTATCAGGGCCCCAGTTCTCTTTGGGCTGTCCCTGGCTGCCATCACTGCCCATTCCAGTCAGCC 

AGGATGGATGGGGGTATGAGATTTTGGGGGTTGGCCAGCTGGTGCCAGACTTTTGGTGCTAAGGCCTGCAAGGGG 

CCTGGGGCAGTGCGTATTCTCTTCCCTCTGACCTGTGCTCAGGGCTGGCTCTTTAGCAATGCGCTCAGCCCAATT 

TGAGAACCGCCTTCTGATTCAAGAGGCTGAATTCAGAGGTCACCTCTTCATCCCATCAGCTCCCA 

AGCACC^GGACTGGAGGGAGAAGCGCCTC^CCCCTTCCCTTCCTTCTTTCCAGGCCCTTAGTCTTGCCAAACCCC 

AGCTGGTGGCCTTTCAGTGCCATTGACACTGCCCAAGAATGTCCAGGGGCAAAGGAGGGATGATACAGAGTTCAG 

CCCGTTCTGCCTC CACAGCTGTGGGCAC CCCAGTGCCTACCTTAGAAAGGGGCTTCAGGAAGGGATGTGCTG TTT 

CCCTCTACGTGCCCAGTCCTAGCCTCGCTCTAGGACCCAGGGCTGGCTTCTAAGTTTCCGTCO^GTCTTCAGGCA 

AGTTCTGTGTTAGTCATGCACACACATACCTATGAAACCTTGGAGTTTACAAAGAATTGCCCCAGCTCT 

CCTGGCCACCCTGGTCCTTGGATCCCCTTCGTCCCACCTGGTCCACCCCAGATGCrGAGGATGGGGGAGCTCAGG 

CGGGGCCTCTGCTTTGGGGATGGGAATGTGTTTTTCTCCCAAACTTGTTTTTATAGCTCTGCTTGAAGGGCTGGG 

AGATGAGGTGGGTCTGGATCTTTTCTCAGAGCGTCTCCATGCTATGGTTGCATTTCCGTTTTCTATGAATGAATT 

T G CATT C AAT AAAC AAC C AG AC T CAAAAAAAAAAAAAAA 
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FIGURE 130 



></usr/seqdb2/sst /DNA/Dnaseqs . min/ss ♦ DNA66659 
xsubunit 1 of 1, 832 aa, 1 stop 
><MW: 94454, pi: 6.94, NX(S/T): 12 

MFALGLPFLVLLVASVESHLGVLGPKNVSQKDAEFERTYVDEVNSELVNIYTFNHTVTRNRT 
EGWVSVNVLNKQKGAPLLFWRQKEAWSFQVPLILRGMFQRKYLYQKVERTLCQPPTKNE 
SEIQFFYVDVSTLSPVNTTYQLRVSRMDDFVLRTGEQFSFNTTAAQEQYFKYEFPEGVDSVI 
VKVTSNKAFPCSVISIQDVLCPVYDLDNNVAFIGMYQTMTKKAAITVQRKDFPSNSFYVVW 
VKTEDQACGGSLPFYPFAEDEPVDQGHRQKTLSVLVSQAVTSEAYVSGMLFCLGIFLSFYLL 
TVLLACWENWRQKKKTLLVAIDRACPESGHPRVLADSFPGSSPYEGYNYGSFENVSGSTDGL 
VDS AGTGDLS YG YQGRS FE P VGTRPRVDSMS S VEEDD YDTLTD I DS DKNV I RTKQYLYVADL 
ARKDKRVLRKKYQ I YFWNI AT I AVF YALP WQLV I T YQTWNVTGNQD I C YYNFLCAHPLGN 
LSAFNNILSNLGYILLGLLFLLIILQREINHNRALLRNDLCALECGIPKHFGLFYAMGTALM 
MEGLLSACYHVCPNYTNFQFDTSFMYMIAGLCMLKLYQKRHPDINASAYSAYACLAIVIFFS 
VLGWFGKGNTAFWIVFS 1 1 HI IATLLLSTQLYYMGRWKLDSGIFRRILHVLYTDCIRQCSG 
PLYVDRMVIiL VMGNVI NWS LAAYGL I MRPNDFAS YLLA I GICNLLLYFAFYIIMKLRSGERI 
KLIPLLCIVCTSVVWGFALFFFFQGLSTWQKTPAESREHNRDCILLDFFDDHDIWHFIiSSIA 
MFGSFLVLLTLDDDLDTVQRDKIYVF 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

Transmembrane domains: 

amino acids 292-317, 451-470, 501-520, 607-627, 751-770 

Leucine zipper pattern. 

amino acids 497-518 

N-glycosylation sites. 

amino acids 27-30, 54-57, 60-63, 123-126, 141-144, 165-168, 364- 
367, 476-479, 496-499, 572-575, 603-606, 699-702 
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FIGURE 131 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGC 
TCTCTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTG 
ACCATGGTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGC 
CCAGCCTGCAGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACC 
TGACCAAGTTGCCGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCA 
GGCAAGGCAACTGAGGGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAG 
GGCCCTGGACCGAGAGGAGCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATG 
GACATGTCTTGTGGGGTCCACAGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTG 
CCCCATTTCTCTCAAGCCATCTACAGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCC 
CTTCCTCTTCCTTGAGGCTTCAGACCGGGATGAGCCAGGCACAGCCAACTCGGATCTTCGAT 
TCCACATCCTGAGCCAGGCTCCAGCCCAGCCTTCCCCAGACATGTTCCAGCTGGAGCCTCGG 
CTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGCACCAGCCTTGACCACGCCCTGGAGAGGAC 
CTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGACCAGGCCTCAGGCCACCAGGCCACTG 
CCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGTCCCTAGAGCCTATCCACCTGGCA 
GAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTACACTGGAGTGGG6GTGATGT 
GC ACTATCAC CTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAATGCAGAGGGAAAC CT CT 
ACGTGAC C AG AG AGCTGG ACAGAGAAGC C CAGGCTGAGTAC CTG CT CC AGGTGCGGGCTC AG 
AATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTGATGGATGAGAA 
TGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCTCAGTCCAC 
CAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCAATTCC 
CACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTCCA 
GGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACA 
TCCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGT 
GAAGTCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGAT 
TGGGCCTATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCA 
TTGATGCTGACCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACA 
GAAGGGACTTTTGGCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAA 
GAAC CT CAGTTATG AGG CAG CT C C AAGT CATGAGGTGGTGGTGGTGGTG CAG AGTGTGGCGA 
AG CTGGTGGGG C CAGG C C CAGG C C CTGGAGC CACCG CCACGGTGACTGTG CT AGTGGAGAGA 
GTGATGCCACCCCCCAAGTTGGACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCC 
AGCCGGCTCTTTCCTGCTGACCATCCAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCT 
CCCTAGTCAATGACTCAGAGGGCTGGCTCTGCATTGAGAAATTCTCCGGGGAGGTGCACACC 
GCCCAGTCCCTGCAGGGCGCCCAGCCTGGGGACACCTACACGGTGCTTGTGGAGGCCCAGGA 
TACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAATACCTCTGCACACCCCGCCAAGACCATG 
GCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCTGGCCAGTGGGCACGGTCCCTACAGC 
TTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGCGCCTCCAGACTCTCAATGGTTC 
CCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGTGAACACATAATCCCCGTGG 
TGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGATCGTGTGTCGCTGCAAC 
GTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCACGAAGCTGTCGGC 
AGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTCATTTTCACCC 
ACTGG AC C ATGT CAAGG AAG AAGG AC C CGGATCAAC CAGCAGACAG CGTGCCCCTGAAGGCG 
ACTGTCTgAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAGTCCC 
CTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCAT 
CTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAAC 
TTTATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCA 
GAGAACTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 132 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSG 
KATEGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVP 
HFSQAIYRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRL 
GALALS PKGSTSLDHALERTYQLLVQVKDMGDQASGHQATATVEVS 1 1 ESTWVSLEPIHLAE 
NLKVLYPHHMAQVHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQN 
SHGEDYAAPLELHVLVMDENDNVP I CPPRDPTVS I PELSPPGTEVTRLSAEDADAPGSPNSH 
VVYQLLSPEPEDGVEGRAFQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCE 
VE VAVTD I NDHAPE F I TS Q I GP I SLP EDVEPGTLVAMLTA I DADLE PAFRLMDF AI ERGDTE 
GTFGLDWEPDSGHVRLRLCK^LSYEAAPSHEVVVWQSVAKIjVGPGPGPGATATVTVLVERV 
MPPPKLDQESYEASVPISAPAGSFLLTIQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTA 
QSLQGAQPGDTYTVLVEAQDTALTLAPVPSQYLCTPRQDHGLIVSGPSKDPDLASGHGPYSF 
TLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEPREHI I PVWSHNAQMWQLLVRVI VCRCNV 
EGQC^RKVGRMKGMPTKLSAVGILVGTLVAIGIFLILIFTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 




WO 00/12708 



PCT/US99/20111 



FIGURE 133 



CCGGGGACATGAGGTGGATACTGTTCATTGGGGCCCTTATTGGGTCCAGCATCTGTGGCCAA 
GAAAAATTTTTTGGGGACC AAGTTTTGAGGATT AATGT C AGAAATGGAGACG AG AT CAG C AA 
ATTGAGTCAACT AGTGAATTC AAACAAC TTGAAG CT CAATTT CTGG AAAT CT C C CT CCTC CT 
TCAATCGGCCTGTGGATGTCCTGGTCCCATCTGTCAGTCTGCAGGCATTTAAATCCTTCCTG 
AGATCCCAGGGCTTAGAGTACGCAGTGACAATTGAGGACCTG CAGGCC CTTTTAGACAATGA 
AGATGATGAAATGCAACACAATGAAGGG CAAG AACGGAGC AGT AAT AACTT C AACT ACGGGG 
CTTACCATTCCCTGGAAGCTATTTACCACGAGATGGACAACATTGCCGCAGACTTTCCTGAC 
CTGGCGAGGAGGGTGAAGATTGGACATTCGTTTGAAAACCGGCCGATGTATGTACTGAAGTT 
CAGCACTGGGAAAGGCGTGAGGCGGCCGGCCGTTTGGCTGAATGCAGGCATCCATTCCCGAG 
AGTGGATCTCCCAGGCCACTGCAATCTGGACGGCAAGGAAGATTGTATCTGATTACCAGAGG 
GATCCAGCTATCACCTCCATCTTGGAGAAAATGGATATTTTCTTGTTGCCTGTGGCCAATCC 
TG ATGGATATGT GT AT AC TC AAACT C AAAACCGATTATGGAGGAAGACGCGGT C C C GAAATC 
CTGGAAGCTCCTGCATTGGTGCTGACCCAAATAGAAACTGGAACGCTAGTTTTGCAGGAAAG 
GGAGCCAGCGACAACCCTTGCTCCGAAGTGTACCATGGACCCCACGCCAATTCGGAAGTGGA 
GGTGAAATCAGTGGTAGATTTCATCCAAAAACATGGGAATTTCAAGGGCTTCATCGACCTGC 
ACAGCTACTCGCAGCTGCTGATGTATCCATATGGGTACTCAGTCAAAAAGGCCCCAGATGCC 
GAGGAACTCGACAAGGTGGCGAGGCTTGCGGCCAAAGCTCTGGCTTCTGTGTCGGGCACTGA 
GTACCAAGTGGGTCCCACCTGCACCACTGTCTATCCAGCTAGCGGGAGCAGCATCGACTGGG 
CGTATGACAACGGCATCAAATTTGCATTCACATTTGAGTTGAGAGATACCGGGACCTATGGC 
TTCCTC CTGCCAGCT AACCAGATC AT CC CC ACTG CAGAGGAG ACGTGG CTGGGG CT GAAG AC 
CATCATGGAGCATGTGCGGGACAACCTCTAC TAG GCGATGGCTCTGCTCTGTCTACATTTAT 
TTGTACCCACACGTGCACGCACTGAGGCCATTGTTT^AAGGAGCTCTTTCCTACCTGTGTGAG 
TCAGAGCCCTCTGGGTTTGTGGAGCACACAGGCCTGCCCCTCTCCAGCCAGCTCCCTGGAGT 
CGTGTGTCCTGGCGGTGTCCCTGCAAGAACTGGTTCTGCCAGCCTGCTCAATTTTGGTCCTG 
CTGTTTTTGATGAGCCTTTTGTCTGTTTCTCCTTCCACCCTGCTGGCTGGGCGGCTGCACTC 
AGCATCACCCCTTCCTGGGTGGCATGTCTCTCTCTACCTCATTTTTAGAACCAAAGAACATC 
TGAGATGATTCTCTACCCTCATCCACATCTAGCCAAGCCAGTGACCTTGCTCTGGTGGCACT 
GTGGGAGACACCACTTGTCTTTAGGTGGGTCTCAAAGATGATGTAGAATTTCCTTTAATTTC 
TCGCAGTCTTCCTGGAAAATATTTTCCTTTGAGCAGCAAATCTTGTAGGGATATCAGTGAAG 
GTCTCTCCCTCCCTCCTCTCCTGTTTTTTTTTTTTTTGAGACAGAGTTTTGCTCTTGTTGCC 
CAGGCTGGAGTGTGATGGCTCGATCTTGGCTCACCACAACCTCTGCCTCCTGGGTTCAAGCA 
ATTCTCCTGCCTCAGCCTCTTGAGTAGCTTGGTTTATAGGCGCATGCCACCATGCCTGGCTA 
ATTTTGTGTTTTTAGTAGAGACAGGGTTTCTCCATGTTGGTCAGGCTGGTCTCAAACTCCCA 
ACCTCAGGTGATCTGCCCTCCTTGGCCTCCCAGAGTGCTGGGATTACAGGTGTGAGCCACTG 
TGCCGGGCCCGTCCCCTCCTTTTTTAGGCCTGAATACAAAGTAGAAGATCACTTTCCTTCAC 
TGTGCTGAGAATTTCTAGATACTACAGTTCTTACTCCTCTCTTCCCTTTGTTATTCAGTGTG 
ACCAGGATGGCGGGAGGGGATCTGTGTCACTGTAGGTACTGTGCCCAGGAAGGCTGGGTGAA 
GTGACCATCTAAATTGCAGGATGGTGAAATTATCCCCATCTGTCCTAATGGGCTTACCTCCT 
CTTTGCCTTTTGAACTCACTTCAAAGATCTAGGCCTCATCTTACAGGTCCTAAATCACTCAT 
CTGGCCTGGATAATCTCACTGCCCTGGCACATTCCCATTTGTGCTGTGGTGTATCCTGTGTT 
TCCTTGTCCTGGTTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTTTGTGTGTGTGTG 
TCTGTCTATTTTGTATC CTGGACC AC AAGT TCCTAAGTAGAG CAAGAATTCAT CAAC CAGCT 
GCCTCTTGTTTCATTTCACCTCAGCACGTACCATCTGTCCTTTTGTTGTTGTTGTTTTGTTT 
TTGTTTTTTTGCTTTTACCAAACATGTCTGTAAATCTTAACCTCCTGCCTAGGATTTGTACA 
GCATCTGGTGTGTGCTTATAAGCCAATAAATATTCAATGTGAAAAAAAAAAAAAAAAA 
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FIGURE 134 



MRWILFIGALIGSSICGQEKFFGDQVLRINVRNGDEISKLSQLVNSNNLKLNFWKSPSSFNR 
PVDVLVPSVSLQAFKSFLRSQGLEYAVTIEDLQALLDNEDDEMQHNEGQERSSNNFNYGAYH 
SLEAIYHEMDNIAADFPDLARRVKIGHSFENRPMYVLKFSTGKGVRRPAVWLNAGIHSREWI 
SQATAIWTARKIVSDYQRDPAITSILEKMDIFLLPVANPDGYVYTQTQNRLWRKTRSRNPGS 
SCI GAD PNRNWN AS F AGKGAS DNP C S E VYHGPHANS E VEVKS WDF I QKHGNF KG F I DLHS Y 
SQLLMYPYGYSVKKAPDAEELDKVARLAAKALASVSGTEYQVGPTCTTVYPASGSSIDWAYD 
NG I KFAFTFELRDTGTYGFLLPANQ 1 1 PTAEETWLGLKT I MEHVRDNL Y 

Signal peptide : 

amino acids 1-16 
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FIGURK 135 



C AACCATG CAAGGAC AGGGCAGGAGAAGAGG AAC CTG CAAAG ACATATTTTGTTC C AAAATG 

GCATCTTACCTTTATGGAGTACTCTTTGCTGTTGGCCTCTGTGCTCCAATCTACTGTGTGTC 

CCCGGCCAATGCCCCCAGTGCATACCCCCGCCCTTCCTCCACAAAGAGCACCCCTGCCTCAC 

AGGTGTATTCCCTCAACACCGACTTTGCCTTCCGCCTATACCGCAGGCTGGTTTTGGAGACC 

CCGAGTCAGAACATCTTCTTCTCCCCTGTGAGTGTCTCCACTTCCCTGGCCATGCTCTCCCT 

TGGGGCCCACTCAGTCACCAAGACCCAGATTCTCCAGGGCCTGGGCTTCAACCTCACACACA 

CACCAGAGTCTGCCATCCACCAGGGCTTCCAGCACCTGGTTCACTCACTGACTGTTCCCAGC 

AAAGACCTGACCTTGAAGATGGGAAGTGCCCTCTTCGTCAAGAAGGAGCTGCAGCTGCAGGC 

AAATTTCTTGGGCAATGTCAAGAGGCTGTATGAAGCAGAAGTCTTTTCTACAGATTTCTCCA 

AC CC CTCCATTG C C CAGG CGAGGAT CAACAGCCATGTGAAAAAGAAGACC CAAGGGAAGGTT 

GTAGACATAATCCAAGGCCTTGACCTTCTGACGGCCATGGTTCTGGTGAATCACATTTTCTT 

TAAAGCCAAGTGGGAGAAGCCCTTTCACCTTGAATATACAAGAAAGAACTTCCCATTCCTGG 

TGGGCGAGCAGGTCACTGTGCAAGTCCCCATGATGCACCAGAAAGAGCAGTTCGCTTTTGGG 

GTGGATACAGAGCTGAACTGCTTTGTGCTG CAGATGGATT AC AAGGGAGATG CCGTGGC C TT 

CTTTGTCCTCCCTAGCAAGGGCAAGATGAGGCAACTGGAACAGGCCTTGTCAGCCAGAACAC 

TGATAAAGTGGAGCCACTCACTCCAGAAAAGGTGGATAGAGGTGTTCATCCCCAGATTTTCC 

ATTTCTGCCTCCTACAATCTGGAAACCATCCTCCCGAAGATGGGCATCCAAAATGCCTTTGA 

C7UVAAATGCTGATTTTTCTGGAATTGCAAAGAGAGACTCCCTGCAGGTTTCTAAAGCAACCC 

AC AAGG CTGTGCTGGATGT C AG TG AAGAGGGC ACTGAGG C CACAGCAGCTACC ACCAC CAAG 

TTCATAGTCCGATCGAAGGATGGTCCCTCTTACTTCACTGTCTCCTTCAATAGGACCTTCCT 

GATGAT GATT AC AAAT AAAG CCACAGACGGT ATTCTCTTTCTAGGG AAAGTGGAAAAT CC CA 

CTAAATCCTAGGTGGGAAATGGCCTGTTAACTGATGGCACATTGCTAATGCACAAGAAATAA 

CAAACCACATCCCTCTTTCTGTTCTGAGGGTGCATTTGACCCCAGTGGAGCTGGATTCGCTG 

GCAGGGATGCCACTTCCAAGGCTCAATCACCAAACCATCAACAGGGACCCCAGTCACAAGCC 

AACACC C ATTAACCCCAGTCAGTGCCCTTTT C CACAAATT CT CC CAGGTAACTAG CTT CATG 

GGATGTTGCTGGGTTACCATATTTCCATTCCTTGGGGCTCCCAGGAATGGAAATACGCCAAC 

CCAGGTTAGGCACCTCTATTGCAGAATTACAATAACACATTCAATAAAACTAAAATATGAAT 

TCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAA 
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FIGURE 136 



MASYLYGVLFAVGLCAPIYC7SPANAPSAYPRPSSTKSTPASQVYSLNTDFAFRLYRRLVLE 
TPSQNIFFSPVSVSTSLAMLSLGAHSVTKTQILQGLGFNLTHTPESAIHQGFQHLVHSLTVP 
SKDLTLKMGSALFVKKELQLQANFLGNVKRLYEAEVFSTDFSNPSIAQARINSHVKKKTQGK 
VVDIIQGLDLLTAMVLVNHIFFKAKWEKPFHLEYTRKNFPFLVGEQVTVQVPMMHQKEQFAF 
GVDTELNCFVLQMD YKGDAVAFFVLPSKGKMRQLEQALSARTL I KWSHSLQKRWIEVF I PRF 
S I S AS YNLET I L PKMG I QNAFDKNADFSG I AKRD S LQ VS KATHKAVLDVSEEGTEATAATTT 
KF I VRSKDGP S YFTVS FNRTFLMM I TNKATDG I LFLGKVENPTKS 

Signal peptide: 

amino acids 1-20 



/37 /3- 7C> 
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FIGURE 137 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGC 

CTGAGTCCAAGATTCTTCCCAGGAACACAAACGTAGGAGACCCACGCTCCTGGAAGCACCAG 

CCTTTATCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAG 

TCTTGGTACAT CTAGGACCCAGGC ATCTTG CTTT CC AGCC ACAAAG AG AC AGATGAAGATG C 

AGAAAGGAAATGTTCTCCTTATGTTTGGTCTACTATTGCATTTAGAAGCTGCAACAAATTCC 

AATGAGACTAG CACCT CTGC CAAC ACTGGATC C AGTGTGAT CTC CAGTGG AG C CAGCACAGC 

CACCAACTCTGGGTCCAGTGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCA 

GCGTGAC CTC CAATGGGGTC AG CATAGT C AC CAACTCTGAGTT C CATACAAC CT CCAGTGGG 

ATCAGCACAGCCACCAACTCTGAGTTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAA 

CTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAC 

CCTCCAGTGGGGCCAGCACAGTCACCAACTCTGGGTCCAGTGTGACCTCCAGTGGAGCCAGC 

ACTGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACTGCCACCAACTCTGA 

GTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCA 

GTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCC 

ACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACTGCCACCAACTCTGAGTCCAG 

CACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGAACGACCTCCAATGGGG 

CTGGCAC AG C CACCAACT CTGAGT CC AGCACG AC CT CC AGTGGGGC CAGC AC AG CCAC CAAC 

TCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACGAC 

CTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCTAGCA 

CAG CCAC CAACTCTGACT CC AG CACAAC CT CC AGTGGGGC CGGC AC AG CC AC CAACTCTGAG 

TCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCCAG 

TGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCA 

CCAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGC 

ACAACCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGC 

TAGC AC AGC CAC CAACTCTGACT C C AGC AC AAC CT C C AGTGAGG CCAGCACAGCCAC CAACT 

CTGAGT CT AGCACAGTGT C C AGTG GGATCAGC AC AG TCACCAATTCTGAGTCCAGCACAACC 

TCCAGTGGGGCCAACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAAC 

AGCAGCTCTGACTGGAATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAA 

AGCCTGGTGGGTCCCTGGTGCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCG 

GCCGTGGGGCTCTTTGCTGGGCTCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACAC 

CTTTAACACAGCTGTCTACCACCCTCATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAG 

GGAATC ATGGAG CCCCC CACAGGCCCAGGTGGAGTC CT AACTGG TT CTGGAGGAGACCAGT A 

TC AT CG AT AG CC ATGG AG ATGAGCGGGAGGAACAGCGGGC C CTGAGC AG CC CCGG AAGC AAG 

TGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCACCCAAGACCTGGTTTCCTTTCATTCATC 

CC AGGAGAC CCCTC CCAGCT TTGTTTGAGATC CTGAAAAT CTTGAAGAAGGT ATTC CT C ACC 

TTTCTTGCCTTTACCAGACACTGGAAAGAGAATACTATATTGCTCATTTAGCTAAGAAATAA 

ATACATCTCATCTAACACACACGACAAAGAGAAGCTGTGCTTGCCCCGGGGTGGGTATCTAG 

CTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTCCATCTGGCATTCAAAA 

TCTCCACAGTAAAATCCAAAGACCTCAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAA 
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FIGURE 138 



MKMQ KGNVLLMFGL LLHLEAATNSNETS T S ANTG SSVISS GAST ATNS GS S VTS S G VS TAT I 
SGSSVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSE 
SSTPSSGASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSS 
TTS SGASTATNS ES STTS SGASTATNSE SSTVS SRASTATNSES STTS SGASTATNSESRTT 
SNGAGTATNSESSTTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSS 
GASTATNSDSSTTSSGAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGA 
NTATNSESSTVS SGASTATNSE SS TTS SGVSTATNSESSTTSSGASTATNSDSSTTSS EAST 
ATNSESSTVSSGISTVTNSESSTTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAV 
SEAKPGGSLVPWEIFLITLVSWAAVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGP 
GPGGNHGAPHRPRWSPNWFWRRPVSSIAMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain : 

amino acids 510-532 
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FIGURE 139 



GGGAGAGAGGATAAATAGCAGCGTGGCTTCCCTGGCTCCTCTCTGCATCCTTCCCGACCTTC 
CCAGCAATATGCATCTTGCACGTCTGGTCGGCTCCTGCTCCCTCCTTCTGCTACTGGGGGCC 
CTGTCTGGATGGGCGGCCAGCGATGACCCCATTGAGAAGGTCATTGAAGGGATCAACCGAGG 
GCTGAGCAATGCAGAGAGAGAGGTGGGCAAGGCCCTGGATGGCATCAACAGTGGAATCACGC 
ATGCCGGAAGGGAAGTGGAGAAGGTTTTCAACGGACTTAGCAACATGGGGAGCCACACCGGC 
AAGGAGTTGGACAAAGGCGTCCAGGGGCTCAACCACGGCATGGACAAGGTTGCCCATGAGAT 
CAACCATGGTATTGGACAAGCAGGAAAGGAAGCAGAGAAGCTTGGCCATGGGGTCAACAACG 
CTGCTGGACAGGCCGGGAAGGAAGCAGACAAAGCGGTCCAAGGGTTCCACACTGGGGTCCAC 
CAGGCTGGGAAGGAAGCAGAGAAACTTGGCCAAGGGGTCAACCATGCTGCTGACCAGGCTGG 
AAAGGAAGTGGAGAAGCTTGGCCAAGGTGCCCACCATGCTGCTGGCCAGGCCGGGAAGGAGC 
TGCAGAATGCTCATAATGGGGTCAACCAAGCCAGCAAGGAGGCCAACCAGCTGCTGAATGGC 
AACCATCAAAGCGGATCTTCCAGCCATCAAGGAGGGGCCACAACCACGCCGTTAGCCTCTGG 
GGCCTCAGTCAACACGCCTTTCATCAACCTTCCCGCCCTGTGGAGGAGCGTCGCCAACATCA 
TGCCCTAAACTGGCATCCGGCCTTGCTGGGAGAATAATGTCGCCGTTGTCACATCAGCTGAC 
ATGACCTGGAGGGGTTGGGGGTGGGGGACAGGTTTCTGAAATCCCTGAAGGGGGTTGTACTG 
GGATTTGTGAATAAACTTGATACACCA 
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FIGURE 140 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA66675 
xsubunit 1 of 1, 247 aa, 1 stop 
><MW: 25335, pi: 7.00, NX(S/T): 0 

MHLARLVGS CSLLLLLGALSGWAASDDP I E KVI EG INRGLSNAERE VGKALDG INSG ITHAG 
REVEKVFNGLSNMGSHTGKELDKGVQGLNHGMDKVAHE INHG I GQAGKEAEKLGHGVNNAAG 
QAGKEADKAVQGFHTGVHQAGKEAEKLGQGVNHAADQAGKEVEKLGQGAHHAAGQAGKELQN 
AHNGVNQASKEANQLLNGNHQSGSSSHQGGATTTPI^SGASVNTPFINLPALWRSVANIMP 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

Homologous region to circumsporozoite (CS) repeats: 

amino acids 35-225 
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FIGURK 141 



CTCCGGGTCCCCAGGGGCTGCGCCGGGCCGGCCTGGCAAGGGGGACGAGTCAGTGGACACTCCAGGAAGAGCGGC 

CCCGCGGGGGGCGATGACCGTGCGCTGACCCTGACTCACTCCAGGTCCGGAGGCGGGGGCCCCCGGGGCGACTCG 

GGGGCGGACCGCGGGGCGGAGCTGCCGCCCGTGAGTCCGGCCGAGCCACCTGAGCCCGAGCCGCGGGACACCGTC 

GCTCCTGCTCTCCGAATGCTGCGCACCGCGATGGGCCTGAGGAGCTGGCTCGCCGCCCCATGGGGCGCGCTGCCG 

CCTCGGCCACCGCTGCTGCTGCTCCTGCTGCTGCTGCTCCTGCTGCAGCCGCCGCCTCCGACCTGGGCGCTCAGC 

CCCCGGATCAGCCTGCCTCTGGGCTCTGAAGAGCGGCCATTCCTCAGATTCGAAGCTGAACACATCTCCAACTAC 

ACAGCCCTTCTGCTGAGCAGGGATGGCAGGACCCTGTACGTGGGTGCTCGAGAGGCCCTCTTTGCACTCAGTAGC 

AACCTCAGCTTCCTGCCAGGCGGGGAGTACCAGGAGCTGCTTTGGGGTGCAGACGCAGAGAAGAAACAGCAGTGC 

AGCTTCAAGGGCAAGGACCCACAGCGCGACTGTCAAAACTACATCAAGATCCTCCTGCCGCTCAGCGGCAGTCAC 

CTGTTCA.CCTGTGGCACAGCAGCCTTCAGCCCCATGTGTACCTACATCAACATGGAGAACTTCACCCTGGCAAGG 

GACGAGAAGGGGAATGTCCTCCTGGAAGATGGCAAGGGCCGTTGTCCCTTCGACCCGAATTTCAAGTCCACTGCC 

CTGGTGGTTGATGGCGAGCTCTACACTGGAACAGTCAGCAGCTTCCAAGGGAATGACCCGGCCATCTCGCGGAGC 
CAAAGCCTTCGCCCCACCAAGACCGAGAGCTCCCT 

ATTCCTGAGAGCCTGGGCAGCTTGCAAGGCGATGATGACAAGATCTACTTTTTCTTCAGCGAGACTGGCCAGGAA 
TTTGAGTTCTTTGAGAACACCATTGTGTCCCGCATTGCCCGCATCTGCAAGGGCGATGAGGGTGGAGAGCGGGTG 
CTACAGCAGCGCTGGACCTCCTTCCTCAAGGCCCAGCTGCTGTGCTC^CGGCCCGACGATGGCTTCCCCTTCAAC 
GTGCTGCAGGATGTCTTCACGCTGAGCCCCAGCCCCCAGGACTGGCGTGACA.CCCTTTTCTATGGGGTCTTCACT 
TCCCAGTGGCACACX^GAACTACAGAAGGCTCTGCCGTCTGTGTCTTCACAATGAAGGATGTGCAQAGAGTCTTC 
AGCGGCCTCTACAAGGAGGTGAACCGTGAGACACAGCAGTGGTACACCGTGACCCACCCGGTGCCCACACCCCGG 
CCTGGAGCGTGCATCACCAACAGTGCCCGGGAAAGGAAGATCAACTCATCCCTGCAGCTCCCAGACCGCGTGKrrG 
AACTTCCTCAAGGACCACTTCCTGATGGACGGGCAGGTCCGAAGCCGCATGCT 

TACCAGCGCGTGGCTGTACACCGCGTCCCTGGCCTGCACCACACCTACGATGTCCTCTTCCTGGGCACTGGTGAC 

GGCCGGCTCCACAAGGCAGTGAGCGTGGGCCCCCGGGTGCACATCATTGAGGAGCTGCAGATCTTCTCATCGGGA 

C^GCCCGTGCAGAATCTGCTCCTGGACACCCACAGGGGGCTGCTGTATGCGGCCTCACACTCGGGCGTAGTCCAG 

GTGCCCATGGCCAACTGCAGCCTGTACCGGAGCTGTGGGGACTGCCTCCTCGCCCGGGACCCCTACTOTGCTTGG 

AGCGGCTCCAGCTGCAAGCACGTCAGCCTCTACCAGCCTCAGCTGGCCACCAGGCCGTGGATCCAGGACATCGAG 

GGAGCCAGCGCCAAGGACCTTTGCAGCGCGTCTTCGGTTGTGTCCCCGTCTTTTGTACCAACAGGGGAGAAGCCA 

TGTGAGCAAGTCCAGTTCCAGCCCAACACAGTGAACACTTTGGCCTGCCCGCTCCTCTCCAACCTGGCGACCCGA 

CTCTGGCTACGKIAACGGGGCCCCCGTCAATGCCrCGGCCTCCTGCCACGTGCTACCCACTGGGGACCTGCTGCTG 

GTGGGCACCCAACAGCTGGGGGAGTTCCAGTGCTGGTCACTAGAGGAGGGCTTCCAGCAGCTGGTAGCCAGCTAC 

TGCCCAGAGGTGGTGGAGGACGGGGTGGCAGACCAAACAGATGAGGGTGGCAGTGTACCCGTC^TTATCAGCACA 

TCGCGTGTGAGTGCACCAGCTGGTGGCAAGGCCAGCTGGGGTGCAGACAGGTCCTACTGGAAGGAGTTCCTGGTG 

ATGTGCACGCTCTTTGTGCTGGCCGTGCTGCTCCCAGTTTTATTCTTGCTCTACCGGCACCGGAACAGCATGAAA 

GTCTTCCTGAAGCAGGGGGAATGTGCX!AGCGTGCACCCCAAGACCTGCCCTGTGGTGCTGCCCCCTGAGACCCGC 

CCACTCAACGGCCTAGGGCCCCCTAGCACCCCGCTCGATCACCGAGGGTACCAGTCCCTGTCAGACAGCCCCCCG 

GGGGCCCGAGTCTTCACTGAGTCAGAGAAGAGGCCACTCAGCATCCAAGACAGCTTCGTGGAGGTATCCCCAGTG 

TGCCCCCGGCCCCGGGTCCGCCTTGGCTCGGAGATCCGTGACTCTGTGGTGTGAGAGCTGACTTCCAGAGGACGC 

TGCCCTGGCTTCAGGGGCTGTGAATGCTCGGAGAGGGTCAACTGGACCTCCCCTCCGCTCTGCTCTTCGTGGAAC 

ACGACCGTGGTGCCCGGCCCTTGGGAGCCTTGGAGCCAGCTGGCCTGCTGCTCTCCAGTCAAGTAGCGAAGCTCC 

TACCACCCAGACACCCAAACTlGCCGTGGCCCCAGAGGTCCTGGCCAAATATGGGGGCCTGCCrAGGTTGGTGGAA 

CAGTGCTCCTTATGTAAACTGAGCCCTTTGTTTAAAAAACAATTCCAAATGTGAAACTAGAATGAGAGGGAAGAG 

ATAGCATGGCATGCAGCACACACGGCTGCTCCAGTTCATGGCCTCCCAGGGGTGCTGGGGATGCATCCAAAGTGG 

TTGTCTGAGACAGAGTTGGAAACCCTCACCAACTGGCCTCTTCACCTTCCACATTATCCCGCTGCCACCGGCTGC 

CCTGTCTC^CTGCAGATTCAGGACCTVGCTTGGGCTGCGTGCGTTCTGCCTTGCCAGTCAGCCGAGGATGTAGTTG 

TTGCTGCCGTCGTCCCACCACCTCAGGGACCAGAGGGCTAGGTTGGCACTGCGGCCCTCACCAGGTCCTGGGCTC 

GGACCCAACTCCTGGACCTTTCCAGCCTGTATCAGGCTGTGGCCACACGAGAGGACAGCGCGAGCTCAGGAGAGA 

TTTCGTGACAATGTACGCCTTTCCCTCAGAATTCAGGGAAGAGACTGTCGCCTGCCTTCCTCCGTTGTTGCGTGA 

GAACCCGTGTGCCCCTTCCCACCATATCCACCCTCGCTCCATCTTTGAACTCAAACACGAGGAACTAACTGCACC 

CTGGTCCTCTCCCCAGTCCCCAGTTCACCCTCCATCCCTCACCTTCCTCCACTCTAAGGGATATCAACACTGCCC 

AGCACAGGGGCCCTGAATTTATGTGGTTTTTATACATTTTTTAATAAGATGCACTTTATGTCATTTTTTAATAAA 

GTCTGAAGAATTACTGTTTAAAAAAAAAAAA 
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FIGURE 142 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA67962 
xsubunit 1 of 1, 837 aa, 1 stop 
><MW: 92750, pi: 7.04, NX(S/T): 6 

MLRTAMGLRSWLAAPWGALPPRPPLLLLLLLLLLLQPPPPTWALSPRISLPLGSEERPFLRF 

EAEHISNYTALLLSRDGRTLYVGAREALFALSSNLSFLPGGEYQELLWGADAEKKQQCSFKG 

KDPQRDCQNYIKILLPLSGSHLFTCGTAAFSPMCTYINMENFTLARDEKGNVLLEDGKGRCP 

FDPNFKSTALWDGELYTGTVSSFQGNDPAISRSQSLRPTKTESSLNWLQDPAFVASAYIPE 

SLGSLQGDDDKIYFFFSETGQEFEFFENTIVSRIARICKGDEGGERVLQQRWTSFLKAQLLC 

SRPDDGFPFNVLQDVFTLSPSPQDWRDTLFYGVFTSQWHRGTTEGSAVCVFTMKDVQRVFSG 

LYKEVNRETQQWYTVTHPVPTPRPGACITNSARERKINSSLQLPDRVLNFLKDHFLMDGQVR 

SRMLLLQPQARYQRVAVHRVPGLHHTYDVLFLGTGDGRLHKAVSVGPRVHIIEELQIFSSGQ 

PVQNLLLDTHRGLLYAASHSGWQVPMANCSLYRSCGDCLLARDPYCAWSGSSCKHVSLYQP 

QLATRPWIQDIEGASAKDLCSASSWSPSFVPTGEKPCEQVQFQPNTVNTLACPLLSNLATR 

LWLRNGAPVNASASCHVLPTGDLLLVGTQQLGEFQCWSLEEGFQQLVASYCPEWEDGVADQ 

TDEGGSVPVIISTSRVSAPAGGKASWGADRSYWKEFLVMCTLFVLAVLLPVLFLLYRHRNSM 

KVFLKQGECASVHPKTCPWLPPETRPLNGLGPPSTPLDHRGYQSLSDSPPGARVFTESEKR 
PLSIQDSFVEVSPVCPRPRVRLGSEIRDSW 

Transmembrane domains : 

amino acids 23-46 (type II), 718-738 
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FIGURE 143A 

CTAAGCCGGAGGATGTGCAGCTGCGGCGGCGGCGCCGGCTACGAAGAGGACGGGGACAGGCGCCGTGCGAACCGA 

GCCCAGCCAGCCGGAGGACGCGGGCAGGGCGGGACGGGAGCCCGGACTCGTCTGCCGCCGCCGTCGTCGCCGTCG 

TGCCGGCCCCGCGTCCCCGCGCGCGAGCGGGAGGAGCCGCCGCCACCTCGCGCCCGAGCCGCCGCTAGCGCGCGC 

CGGGCATGGTCCCCTCTTAAAGGCGCAGGCCGCGGCGGCGGGGGCGGGTGTGCGGAACAAAGCGCCGGCGCGGGG 

CCTGCGGGCGGCTCGGGGGCCGCGATGGGCGCGGCGGGCCCGCGGCGGCGGCGGCGCTGCCCGGGCCGGGCCTCG 

CGGCGCTAGGGCGGGCTGGCCTCCGTGGGCGGGGGCAGCGGGCTGAGGGCGCGCGGAGCCTGCGGCGGCGGCGGC 

GGCGGCGGCGGCGGCCCGGCGGGCGGAGCGGCGCGGGCASGGCCGCGCGCGGCCGGCGCGCCTGGCTCAGCGTGC 

TGCTCGGGCTCGTCCTGGGCTTCGTGCTGGCCTCGCGGCTCGTCCTGCCCCGGGCTTCCGAGCTGAAGCGAGCGG 

GCCCACGGCGCCGCGCCAGCCCCGAGGGCTGCCGGTCCGGGCAGGCX3GCGGCTTCCCAGGCCGGCGGGGCGCGCG 

GCGATGCGCGCGGGGCGCAGCTeTGGCCGCCCGGCTCGGACCCAGATGGCGGCCCGCGCGACAGGAACTTTCTCT 

TCGTGGGAGTCATGACCGCCCAGAAATACCTGCAGACTCGGGCCGTGGCCGCCTACAGAACATGGTCCAAGACAA 

TTCCTGGGAAAGTTCAGTTCTTCTCAAGTGAGGGTTCTGACACATCTGTACCAATTCCAGTAGTGCCACTACGGG 

GTGTGGACGACTCCTACCCGCCCCAGAAGAAGTCCTTCATGATGCTCAAGTACATGCACGACCACTACTTGGACA 

AGTATGAATGGTTTATGAGAGCAGATGATGACGTGTACATCAAAGGAGACCGTCTGGAGAACTTCCTGAGGAGTT 

TGAACAGCAGCGAGCCCCTCTTTCTTGGGCAGACAGGCCTGGGCACCACGGAAGAAATGGGAAAACTGGCCCTGG 

AGCCTGGTGAGAACTTCTGCATGGGGGGGCCTGGCGTGATCATGAGCCGGGAGGTGCTTCGGAGAATGGTGCCGC 

ACATTGGCAAGTGTCTCCGGGAGATGTACACCACCCATGAGGACGTGGAGGTGGGAAGGTGTGTCGGGAGGTTTG 

CAGGGGTGCAGTGTGTCTGGTCTTATGAGATGCGGCAGCTTTTTTATGAGAATTACGAGCAGAACAAAAAGGGGT 

ACATTAGAGATCTCCATAACAGTAAAATTCACCAAGCTATCACATTACACCCCAACAAAAACCCACCCTACCAGT 

ACAGGCTCCACAGCTACATGCTGAGCCGCAAGATATCCGAGCTCCGCCATCGCACAATACAGCTGCACCGCGAAA 

TTGTCCTGATGAGCAAATACAGCAACACAGAAATTCATAAAGAGGACCTCCAGCTGGGAATCCCTCCCTCCTTCA 

TGAGGTTTCAGCCCCGCCAGCGAGAGGAGATTCTGGAATGGGAGTTTCTGACTGGAAAATACTTGTATTCGGCAG 

TTGACGGCCAGCCCCCTCGAAGAGGAATGGACTCCGCCCAGAGGGAAGCCTTGGACGACATTGTCATGCAGGTCA 

TGGAGATGATCAATGCCAACGCCAAGACCAGAGGGCGCATCATTGACTTCAAAGAGATCCAGTACGGCTACCGCC 

GGGTGAACCCCATGTATGGGGCTGAGTACATCCTGGACCTGCTGCTTCTGTACAAAAAGCACAAAGGGAAGAAAA 

TGACGGTCCCTGTGAGGAGGCACGCGTATTTACAGCAGACTTTCAGOUUUVTCCAGTTTGTGGAGCATGAGGAGC 

TGGATGCACAAGAGTTGGCCAAGAGAATCAATCAGGAATCTGGATCCTTGTCCTTTCTCTCAAACTCCCTGAAGA 

AGCTCGTCCCCTTTCAGCTCCCTGGGTCGAAGAGTGAGCACAAAGAACCCAAAGATAAAAAGATAAACATACTGA 

TTCCTTTGTCTGGGCGTTTCGACATGTTTGTGAGATTTATGGGAAACTTTGAGAAGACGTGTCTTATCCCCAATC 

AGAACGTCAAGCTCGTGGTTCTGCTTTTCAATTCTG1ACTCCAACCCTGACAAGGCCAAACAAGTTGAACTGATGA 

GAGATTACCGC^TTAAGTACCCTAAAGCCGACATGCAGATTTTGCCTGTGTCTGGAGAGTrTTCAAGAGCCCTGG 

CCCTGGAAGTAGGATCCTCCC^GTTTAACAATGAATCTTTGCTCTTCTTCTGCGACGTCGACCTCGTGTTTACTA 

CAGAATTCCTTCAGCGATGTCGAGCAAATACAGTTCTGGGCCAACAAATATATTTTCCAATCATCTTCAGCCAGT 

ATGACCCAAAGATTGTTTATAGTGGGAAAGTTCCCAGTGACAACCATTTTGCCTTTACTCAGAAAACTGGCTTCT 

GGAGAAACTATGGGTTTGGCATCACGTGTATTTATAAGGGAGATCTTGTCCGAGTGGGTGGCTTTGATGTTTCCA 

TCCaUVGGCTGGGGGCTGGAGGATGTGGACCTTTTCAACAAGGTTGTCCAGGCAGGTTTGAAGACGTTTAGGAGCC 

AGGAAGTAGGAGTAGTCCACGTCCACCATCCTGTCTTTTGTGATCCCAATCTTGACCCCAAACAGTACAAAATGT 

GCTTGGGGTCCAAAGCATCS3ACCTATGGGTCCACCCAGCAGCTGGCTGAGATGTGGCTGGAAAAAAATGATCCAA 

GTTACAGTAAAAGC AGCAAT AATAATGGCTCAGTGAGGACAGCCTAATGTCCAGCTTTGCTGGAAAAGACGTTTT 

TAATTATCTAATTTATTTTTCAAAAATTTTTTGTATGATCAGTTTTTGAAGTCCGTATACAAGGATATATTTTAC 

AAGTGGTTTTCTTACATAGGACTCCTTTAAGATTGAGCTTTCTGAACAAGAAGGTGATCAGTGTTTGCCTTTGAA 

CAC^TCTTCTTGCTGAACATTATGTAGCAGACCTGCTTAACTTTGACTTGAAATGTACCTGATGAACAAAACTTT 

TTTAAAAAAATGTTTTCTTTTGAGACCCTTTGCTCCAGTCCTATGGCAGAAAACGTGAACATTCCTGCAAAGTAT 

TA " GTAACA !t!^ CTCT ^ 

GTTTTGTTTTTTTTTTTTACAATTGTTTTAAAGCCATTTCATGTTCCAGTTGTAAGATAAGGAAATGTGATAATA 
GCTGTTTCATCATTGTCTTCAGGAGAGCTTTCCAGAGTTGATCATTTCCTCTCATGGTACTCTGCTCAGCATGGC 
CACGTAGGTTTTTTGTTTGTTTTGTTTTGTTCTTTTTTTGAGACGGAGTCTCACTCTGTTACCCAGGCTGGAATG 
CAGTGGCGCAATCTTGGCTCACTTTAACCTCCACTTCCCTGGTTCAAGCAATTCCCCTGCCTTTGCCTCCCGAGT 
AGCTGGGATTACAGGCACACACCACCACGCCCAGNTAGTTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCAT 
GCAAGCCCAGCTGGCCACGTAGGTTTTAAAGCAAGGGGOSTGAAGAAGGCACAGTGAGGTATGTGGCTGTTCTCG 
TGGTAGTTCATTCGGCCTAAATAGACCTGGCATTAAATTTCAAGAAGGATTTGGCATTTTCTCTTCTTGACCCTT 
CTCTTTAAAGGGTAAAATATTAATGTTTAGAATGACAAAGATGAATTATTACAATAAATCTGATGTACACAGACT 
GAAACATACACACATACACCCTAATCAAAACGTTGGGGAAAAATGTATTTGGTTTTGTTCCTTTCATCCTGTCTG 
TGTTATGTGGGTGGAGATGGTTTTCATTCTTTCATTACTGTTTTGTTTTATCCTTTGTATCTGAAATACCTTTAA 

tttatttaatatctgttgttCagagctctgccatttcttgagtacctgttagttagtattatttatgtgtatcgg 

GAGTGTGTTTAGTCTGTTTTATTTGCAGTAAACCGATCTCCAAAGATTTCCTTTTGGAAACGCTTTTTCCCCTCC 
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FIGURE 143B 



TTAATTTTTATATTCCTTACTGTTTTAOTAAATATTAAGTGTTCTTTGACAATTTTGGTGCTCATGTGTTTTGGG 
GACAAAAGTGAAATGAATCTGTC3ATTATACCAGAAAGTTAAATTCTCAGATCAAATGTGCCTTAATAAATTTGTT 
TTCATTTAGATTTCAAACAGTGATAGACTTGC 

CCTGATGCTCATTTGGAAATU^TAAACCAGTGAACAATATTTTTCTATTGTACTTTTCGAACCATTTTGTCTCATT 
ATTCCTGTTTTAGCTGAAGAATTGTATTACATTTGGAGAGTAAA7UVACTTAAACACGAAAAAA 
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FIGURE 144 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA68836 
xsubunit 1 of 1, 802 aa, 1 stop 
><MW: 91812, pi: 9.52, NX(S/T): 3 

MAARGRRAWLSVLLGLVLGFVLASRLVLPRASELKRAGPRRRASPEGCRSGQAAASQAGGAR 
GDARGAQLWPPGSDPDGGPRDRNFLFVGVMTAQKYLQTRAVAAYRTWSKTIPGKVQFFSSEG 
SDTSVPIPWPLRGVDDSYPPQKKSFMMLKYMHDHYLDKYEWFMRADDDVYIKGDRLENFLR 
SLNSSEPLFLGQTGLGTTEEMGKIjALEPGENFCMGGPGVIMSREVLRRJWPHIGKCLREMCT 
THEDVEVGRCVRRFAGVQCVWSYEMRQLFYENYEQNKKGYIRDLHNSKIHQAITLHPNKNPP 
YQYRLHSYMLSRKISELRHRTIQLHREIVLMSKYSNTEIHKEDLQLGIPPSFMRFQPRQREE 
I LEWEFL.TGK YL YS AVDGQ P PRRGMDS AQREALDD I VMQVMEMI NANAKTRGRI IDFKE I Q Y 
G YRRVNPMYGAE Y I LDLLLL, YKKHKGKKMT VP VRRHAYLQQT FS K I QF VEHEELDAQELAKR 
INQESGSLSFLSNSLKKLVPFQLPGSKSEHKEPKDKKINILIPLSGRFDMFVRFMGNFEKTC 
L I PNQNVKL WLLFNSDSNP DKAKQVELMRDYR I KYPKADMQ I L PVS GE FSRALALEVGS SQ 
FNNESLLFFCDVDLVFTTEFLQRCRANTVLGQQIYFPIIFSQYDPKIVYSGKVPSDNHFAFT 
QKTGFWRNYGFG I TC I YKGDLVRVGGFDVS IQGWGLEDVDLFNKWQAGLKTFRSQEVGWH 
VHHPVFCDPNLDPKQYKMCLGSKASTYGSTQQ 

Signal peptide: 

amino acids 1-23 
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GGACAACCGTTGCTGGGTGTCCCAGGGCCTGAGGCAGGACGGTACTCCGCTGACACCTTCCC 
TTTCGGCCTTGAGGTTCCCAGCCTGGTGGCCCCAGGACGTTCCGGTCGCATGGCAGAGTGCT 
ACGGACGACGCCTATCAAGCCCTTAGTCCTTCTAGTTGCGCTTTTGCTATGGCCTTCGTCTG 
TGCCGGCTTATCCGAGCATAACTGTGACACCTGATGAAGAGCAAAACTTGAATCATTATATA 
CAAGTTTTAGAGAACCTAGTACGAAGTGTTCCCTCTGGGGAGCCAGGTCGTGAGAAAAAATC 
TAACTCTCCAAAACATGTTTATTCTATAGCATCAAAGGGATCAAAATTTAAGGAGCTAGTTA 
CACATGGAGACG CTTC AACTG AG AATGATGTTTT AAC C AAT C CTATCAGTGAAGAAACTACA 
ACTTTCCCTACAGGAGGCTTCACACCGGAAATAGGAAAGAAAAAACACACGGAAAGTACCCC 
ATT CTG GT CGAT CAAACC AAACAATG TTTC CATTGTTTTGCATGCAGAGGAACCTTAT ATTG 
AAAATGAAGAGCCAGAGCCAGAGCCGGAGCCAGCTGCAAAACAAACTGAGGCACCAAGAATG 
TTGCCAGTTGTTACTGAATCATCTACAAGTCCATATGTTACCTCATACAAGTCACCTGTCAC 
C ACTTT AG ATAAGAGCACTGGCATTGAGATCT CTAC AG AATC AGAAGATGTT CCT C AG CT CT 
CAGGTGAAACTGCGATAG AAAAACC CGAAGAGTTTGGAAAGC AC CC AG AGAGTTGGAATAAT 
GATGACATTTTGAAAAAAATTTTAGATATTAATTCACAAGTGCAACAGGCACTTCTTAGTGA 
CACCAGGAACCCAGCATATAGAGAAGAT ATTG AAGC CT CT AAAG AT CACC TAAAACGAAGCC 
TTGCTCTAGCAG CAGCAGCAGAACAT AAATTAAAAACAATGTATAAGT CC CAGTTATTGC CA 
GTAGGACG AAC AAGTAAT AAAATTGATG AC AT CGAAACTGTTATTAACATG CTGTGTAATTC 
TAGAT CTAAACT CTATGAATATTTAGAT AT TAAATGTGTTCCAC CAGAGATGAG AGAAAAAG 
CTGCTACAGTATTCAATACATTAAAAAATATGTGTAGATCAAGGAGAGTCACAGCCTTATTA 
AAAGTTTATTAAACAATAATATAAAAATTTTAAACCTACTTGATATTCCATAACAAAGCTGA 
TTTAAG CAAACTGC ATTTTTTC AC AGG AGAAATAATCATATTCGTAATTT CAAAAGTTGTAT 
AAAAATATTTTCTATTGTAGTTCAAATGTGCCAACATCTTTATGTGTCATGTGTTATGAACA 
ATTTTCATATGCACTAAAAACCTAATTTAAAATAAAATTTTGGTTCAGGAAAAAA 
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>< /usr /seqdb2 /as fc /DNA/Dnaseqs . min/ss . DNA6 8 8 64 
xsubunit 1 of 1, 350 aa, 1 stop 
><MW: 39003, pi: 5.59, NX(S/T): 1 

MKPLVLLVALLLWPSSVPAYPSITVTPDEEQNLNHYIQVLENLVRSVPSGEPGREKKSNSPK 
HVYSIASKGSKFKELVTHGDASTENDVLTNPISEETTTFPTGGFTPEIGKKKHTESTPFWSI 
KPNNVSIVLHAEEPYIENEEPEPEPEPAAKQTEAPRMLPWTESSTSPYVTSYKSPVTTLDK 
STGIEISTESEDVPQLSGETAIEKPEEFGKHPESWNNDDILKKILDINSQVQQALLSDTSNP 
AYREDIEASKDHLKRSLALAAAAEHKLKTMYKSQLLPVGRTSNKIDDIETVINMLCNSRSKL 
YEYLDIKCVPPEMREKAATVFNTLKNMCRSRRVTALLKVY 

Signal peptide: 

amino acids 1-19 
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FIGURE 147 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTG 
ACCTCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACA 
GGAATATCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACA 
GTGGCAAGTCACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCT 
CCTGCTCCCTCTTTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAG 
TTCCATGCTGTGGTCCACCTCTACAGAGATGGGGAAGACTGGGAATCTAAGCAGATGCCACA 
GTATCGAGGGAGAACTGAGTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGC 
TAAAAAACATCACTCCCTCGGACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTAC 
GATGAGGAGGCCACCTGGGAGCTGCGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCAT 
CGTGGGATATGTTGACGGAGGTATCCAGTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGC 
CC AC AG C CAAGTGGAAAGGT CC AC AAGG AC AGGATTTGTCTTCAGACTCCAGAGCAAATGCA 
GATGGGT AC AGC CTGT ATGATGTGGAGATCT CCATT ATAGTCCAGGAAAATG CTGGGAG CAT 
ATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGAGGTGGAATCCAAGGTATTGATAGGAG 
AGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTATTTTACTCGGGTTACTCTGTGGT 
GCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTCAAATCCAAAGGGAAAATCCA 
GG CGGAACTGGACTGGAGAAGAAAGCACGGAC AGGCAG AATTGAGAGACG CC CGGAAAC ACG 
CAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAA 
ACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGATTTACAAGGAA 
GAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGACGTGGGAC 
AAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACAATGTG 
ACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTCAC 
ATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCC 
TGGACTATGAGGGTGGGACCATCT CCTT CTTCAATACAAATGACCAGT CC CTTATTTATACC 
CTGCTGACATGTCAGTTTGAAGGCTTGTTGAGACCCTATATCCAGCATGCGATGTATGACGA 
GGAAAAGGGGACTC CCATATTC AT ATGT C CAGTGTC CTGGGG ATGA GACAGAGAAGACCCTG 
CTTAAAGGGCCCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGC 
CCCAGCTTCCTCTCCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGC 
TGAGGTTCTTCTGCCCTGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGAT 
TGGCCTGACCCTGTGGGAGTCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCA 
CATTAGGTTTAGTTTGTGAAAACTCCATCCAGCTAAGCGATCTTGAAC^AGTCACAACCTCC 
CAGG CT CCT CATTTGCTAGT CACGGACAGTGATTCCTGCCTCACAGGTGAAGATTAAAGAG A 
CAACGAATGTGAAT CATG CTTGCAGGTTTGAGGG CACAGTGTTTGCTAATGATGTGTTTTTA 
TATTATACATTTT C CCAC CATAAACTCTGTTTG CTTATTC CACATTAATTTACTTTTCTCTA 
TACCAAATCACCCATGGAATAGTTATTGAACACCTGCTTTGTGAGGCTCAAAGAATAAAGAG 
GAGGTAGGATTTTTCACTGATTCTATAAGCCCAGCATTACCTGATACCAAAACCAGGCAAAG 
AAAACAGAAG AAGAGGAAGGAAAACTAC AGGT CC AT AT CC CT CATTAACACAGACACAAAAA 
TTCTAAATAAAATTTTAACAAATTAAACTAAACAATATATTTAAAGATGATATATAACTACT 
CAGTGTGGTTTGTCCCACAAATGCAGAGTTGGTTTAATATTTAAATATCAACCAGTGTAATT 
CAGCACATTAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA68866 
xsubunit 1 of 1, 466 aa, 1 stop 
><MW: 52279, pi: 6.16, NX(S/T): 2 

MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHA 
WHLYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEE 
ATWELRVAALGS LPL I S I VGYVDGG I QLLCLS SGWF PQPTAKWKGPQGQDLS SDSRANADGY 
SLYDVE I S 1 1 VQENAGS I LCS I HLAEQSHEVESKVL I GETFFQPS PWRLAS I LLGLLCGALC 
GVVMGMIIVFFKSKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVT 
HRKAPQEVPHSEKRFTRKSVVASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLS 
PNNGYWVLRLTTEHLYFTFNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQ5LIYTLLT 
CQFEGLLRP Y IQHAMYDEEKGTP I F I CPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains : 

amino acids 131-150, 235-259 
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FIGURE 149 



CCTTCACAGGACTCTTCATTGCTGGTTGGCAATGATGTATCGGCCAGATGTGGTGAGGGCTA 
GGAAAAGAGTTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTC 
CTGGCAGTGTGCATTGGACTCACTGTTCATTATGTGAGATATAATCAAAAGAAGACCTACAA 
TTACTATAGCACATTGTCATTTACAACTGACAAACTATATGCTGAGTTTGGCAGAGAGGCTT 
CTAACAATTTTACAGAAATGAGCCAGAGACTTGAATCAATGGTGAAAAATGCATTTTATAAA 
TCTCCATTAAGGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTCAGTCAACAGAAGCATGG 
AGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGAAACTGTAG 
ATAAAATTGTTCAACTTG TTTT AC ATGAAAAG CTG C AAGATG CTGT AGGACCCC CTAAAGT A 
G ATC CT CACT CAGTTAAAATT AAAAAAAT C AACAAGACAG AAACAGACAGCTATCTAAACCA 
TTGCTGCGGAACACGAAG7VAGTAAAAGTCTAGGTCAGAGTCTCAGGATCGTTGGTGGGACAG 
AAGTAGAAGAGGGTGAATGGCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGT 
GGAGCAACCTTAATTAATGCCACATGGCTTGTGAGTGCTGCTCACTGTTTTACAACATATAA 
GAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACAATAAAACCTTCGAAAATGAAACGGG 
GT CTCCGGAGAATAATTG T C CATGAAAAATACAAAC ACC CATCACATGACTATGAT ATTT CT 
CTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTiTCCCTGA 
TGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAA 
ATGATGGTTACAGTCAAAAT CATCTTCGACAAGC AC AGGTGACT CT CATAGACGCTACAACT 
TGGAATGAACCTCAAGCTTACAATGACGCCATAACTCCTAGAATGTTATGTGCTGGCTCCTT 
AGAAGGAAAAAC AGATG CATGCCAGGGTGACT CTGGAGGACC ACTGGTTAGTTCAGATGCTA 
GAGAT AT CTGGT AC CTTG CTGGAATAGTGAGCTGGGGAGATG AATGTGCGAAAC CC AACAAG 
CCTGGTGTTTATACTAGAGTTACGGCCTTGCGGGACTGGATTACTTCAAAAACTGGTATCTA 
AGAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTGGGTGTGGAGGCCATT 
TTTAGAGATACAGAATTGGAGAAGACTTGCAAAACAGCTAGATTTGACTGATCTCAATAAAC 
TGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTG 
CCAGATCAACT CTGTCAT CTGTGAGC AATAGT TG AAAC TTT ATGTACATAGAGAAATAGATA 
ATACAATATTACATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGAC 
TTGTTGACATAAATTTGTAATGCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTC 
AGCTCCTCTCATTTCAGCAAATATCCATTTTCAAGGTGCAGAACAAGGAGTGAAAGAAAATA 
TAAG AAGAAAAAAATC CC CT AC ATTTTATTGG CACAGAAAAGTATTAGGTGTTTTTCTTAGT 
GGAATATTAGA^lATGATCATATTCATTATGAAAGGTCAAGCAAAGACAGCAGAATACCAATC 
ACTT CATC ATTT AGGAAGTATGGGAACTAAGTTAAGGAAGTC CAGAAAGAAGCCAAGATATA 
TCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTG 
AC CTAT AATAAT TATACAAACTTC ATGCAATGTACTTGTTCT AAGCAAATTAAAGCAAATAT 
TTATTTAACATTGTTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA68871 
xsubunit 1 of 1, 423 aa, 1 stop 
><MW: 47696, pi: 8.96, NX(S/T): 3 

MMYRPDWRARKRVCWE P WV I GLV IFISLI VL AVC I GLTVHYVR YNQKKTYNYYSTLS FTTD 
KLYAEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQKHGVIiAHMLLICR 
FHSTEDPETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTL 
GQSLRIVGGTEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFG 
VTIKPSKMKRGLRRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDV 
MFVTGFGALKNDGYSQNHLRQAQVTLIDATTCNEPQAYNDAITPRMLCAGSLEGKTDACQGD 
SGGPLVSSDARD I WYLAG I VSWGDECAKPNKPGVYTRVTALRDWI TS KTGI 

Transmembrane domain : 

amino acids 21-40 (type II) 



m 
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FIGURE 151 

GTCGAAGGTTATAAAAGCTTCCAGCCAAACGGCATTGAAGTTGAAGATACAACCTGACAGCA 

CAGCCTGAGATCTTGGGGATCCCTCAGCCTAACACCCACAGACGTCAGCTGGTGGATTCCCG 

CTGCATCAAGGCCTACCCACTGTCTCCATGCTGGGCTCTCCCTGCCTTCTGTGGCTCCTGGC 

CGTGACCTTCTTGGTTCCCAGAGCTCAGCCCTTGGCCCCTCAAGACTTTGAAGAAGAGGAGG 

CAGATGAGACTGAGACGGCGTGGCCGCCTTTGCCGGCTGTCCCCTGCGACTACGACCACTGC 

CGACACCTGCAGGTGCCCTGCAAGGAGCTACAGAGGGTCGGGCCGGCGGCCTGCCTGTGCCC 

AGGACTCTCCAGCCCCGCCCAGCCGCCCGACCCGCCGCGCATGGGAGAAGTGCGCATTGCGG 

CCGAAGAGGGCCGCGCAGTGGTCCACTGGTGTGCCCCCTTCTCCCCGGTCCTCCACTACTGG 

CTGCTGCTTTGGGACGGCAGCGAGGCTGCGCAGAAGGGGCCCCCGCTGAACGCTACGGTCCG 

CAGAGCCGAACTGAAGGGGCTGAAGCCAGGGGGCATTTATGTCGTTTGCGTAGTGGCCGCTA 

ACGAGGCCGGGGCAAGCCGCGTGCCCCAGGCTGGAGGAGAGGGCCTCGAGGGGGCCGACATC 

CCTGCCTTCGGGCCTTGCAGCCGCCTTGCGGTGCCGCCCAACCCCCGCACTCTGGTCCACGC 

GGCCGTCGGGGTGGGCACGGCCCTGGCCCTGCTAAGCTGTGCCGCCCTGGTGTGGCACTTCT 

GCCTGCGCGATCGCTGGGGCTGCCCGCGCCGAGCCGCCGCCCGAGCCGCAGGGGCGCTCTGA 

AAGGGGCCTGGGGGCATCTCGGGCACAGACAGCCCCACCTGGGGCGCTCAGCCTGGCCCCCG 

GGAAAGAGGAAAACCCGCTGCCTCCAGGGAGGGCTGGACGGCGAGCTGGGAGCCAGCCCCAG 

GCTCCAGGGCCACGGCGGAGTCATGGTTCTCAGGACTGAGCGCTTGTTTAGGTCCGGTACTT 

GGCGCTTTGTTTCCTGGCTGAGGTCTGGGAAGGAATAGAAAGGGGCCCCCAATTTTTTTTTA 

AGCGGCCAGATAATAAATAATGTAACCTTTGCGGTTAAAAAAAAAAAAAAAAAA 
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></usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA68874 
xsubunit 1 of 1, 23 8 aa, 1 stop 
><MW: 25262, pi: 6.44, NX(S/T): 1 

MLGSPCLLWLIJVVTFLVPRAQPLAPQDFEEEEADETETAWPPLPAVPCDYDHCRHLQVPCKE 
LQRVGPAACLCPGLSSPAQPPDPPRMGEVRIAAEEGRAVVHWCAPF 

AQKGPPLNATVRRAELKGLKPGGIYWCWAANEAGASRVPQAGGEGLEGADIPAFGPCSRL 
AVPPNPRTLVHAAVGVGTALALLSCAALVWHFCLRDRWGCPRRAAARAAGAL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 194-220 

N-glycosylation site. 

amino acids 132-135 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCC 
CTGCCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGG 
CCCAGCGCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGC 
TTCTCCCTTACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCT 
CTACGCCCTCAATCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGA 
TGAGGGACTACCTAAATAATGTTCTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGTC 
ATTTTGACTTACTTTCCTGTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCAT 
TGTGGGGATGTTAGGATATTGTGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACT 
TTGGAAGTTTGCTTGTCATTTTCTGTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAG 
GAACTTATGGTTCCAGTACAATGGTCAGATATGGTCACTTTGAAAGCCAGGATGACAAATTA 

GCTGTGGAGTAGTATATTTCACTGACTGGTTGGAAATGACAGAGATGGACTGGCCCCCAGAT 

TC CTGCTGTGTTAGAG AATT C C CAGG ATGTT CCAAACAGGCC CAC C AGGAAGAT CT CAGTG A 

CCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTTTTTGAGAGGAACCAAACAACTGC 

AGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAATCCTGGCCATGA*TCTCACC 

ATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGGACAGACCAAATGATGTC 

CTTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACTGTTGAAACCAAGCC 

TGTCAAGAATCTTTGAACACACATCCATGGCAAACAGCTTTAATACACACTTTGAGATGGAG 

GAGTTATAAAAAGAAATGTCACAGAAGAAAACCACAAACTTGTTTTATTGGACTTGTGAATT 

TTTGAGTACATACTATGTGTTTCAGAAATATGTAGAAATAAAAATGTTGCCATAAAATAACA 

CCT AAG CATATACT ATTCTATGCTTTAAAATGAGGATGGAAAAGTTTC ATGT CATAAGTCAC 

CACCTGGACAATAATTGATGCC CT TAAAATG CTG AAGACAG ATG TC ATAC CCACTGTGTAGC 

CTGTGTATGACTTTTACTGAAC AC AG TT ATGTTTTGAGG C AG CATGGTTTGATTAG CATTTC 

CGCATCCATGCAAACGAGTCACATATGGTGGGACTGGAGCCATAGTAAAGGTTGATTTACTT 

CTACCAACTAGTATATAAAGTACTAATTAAATGCTAACATAGGAAGTTAGAAAATACTAATA 

ACTTTTATTACT CAGCGATCTATT CTTCTGATGCTAAATAAATTATATATCAGAAAACTTTC 

AATATTGGTGACTACCTAAATGTGATTTTTGCTGGTTACTAAAATATTCTTACCACTTAAAA 

GAGCAAGCTAACACATTGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAA 

TC TGT ATAATTC AGTCGATTT CAGTTCTGATAATGTTAAGAATAAC CATTATGAAAAGGAAA 

ATTTGTCCTGTATAGCATCATTATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGT 

CCTGGG CTTATATT ACACAT AT AACTGTTATTTAAATACTT AACCACTAATT TTGAAAATTA 

CCAGTGTGATACATAGGAATCATTATTCAGAATGTAGTCTGGTCTTTAGGAAGTATTAATAA 

GAAAATTTGCACATAACTTAGTTGATTCAGAAAGGACTTGTATGCTGTTTTTCTCCCAAATG 

AAGACTCTTTTTGACACTAAACACTTTTTAAAAAGCTTATCTTTGCCTTCTCCAAACAAGAA 

GC AATAGT CT CCAAGTCAATATAAATTCT ACAGAAAATAG TGTT CTTTTTCTCCAGAAAAAT 

GCTTGTGAGAATCATTAAAACATGTGACAATTTAGAGATTCTTTGTTTTATTTCACTGATTA 

ATATACTGTGGCAAATTACACAGATTATTAAATTTTTTTACAAGAGTATAGTATATTTATTT 

GAAATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTATTTCTCAGAATATGGAA 

AG AAAATTAAAATGTGTCAAT AAATATTTT CTAGAGAGTAA 
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FIGURE 154 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA68880 
xsubunit 1 of 1, 305 aa, 1 stop 
><MW: 35383, pi: 5.99, NX (S/T) : 0 

MAREDSVKCLRCLLYALNLLFWLMS I SVLAVS AWMRDYLNNVLTLTAETRVEEAVILTYFPV 
VHPVMI AVCCFLI I VGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWTYEQELMVPVQ 
WSDMVTLKARMTNYGLPRYRWLTHAWNFFQRE 

PGCSKQAHQEDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWAL 
YYDRREPGTDQMMSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-3 3 

Transmembrane domains : 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE 155 



GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCC 
TGCACTCGGGCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGA 
CCTGTGTGGGGAGGCCCTCCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAG 
AC CGGGAGG ATCACAG AG C CAGCATGTT AC AGGAT C CTGACAGTGAT C AACCTCTGAACAG C 
CTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCCATGGAGACCTTCAGAAAGGTGGGGAT 
CCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGTGGTTGTCCTCATCAAGG 
TGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGAGGAAGCAG 
CTGTGTGACGGAGAGCTGGACTGT CC CTTGGGGG AGGACGAGGAGC ACTGTGTCAAGAGCTT 
CCCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGG 
ACTCGGCCACAGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAG 
ACAGCCTGTAGGCAGATGGGCTACAGCAGAGCTGTGGAGATTGGCCCAGACCAGGATCTGGA 
TGTTGTTGAAATCACAGAAAACAGCCAGGAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTC 
TCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTGTGGGAAGAGCCTGAAGACCCCCCGT 
GTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGGTCAGCATCCAGTACGA 
CAAACAGCACGTCTGTGGAGGGAGCATCCTGGACCCCCACTGGGTCCTCACGGCAGCCCACT 
GCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGC 
AGCTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAA 
AGACAATGACATCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGC 
CCATCTGTCTGCCCTTCTTTGATGAGGAGCTCACTCCAGCCACCCCACTCTGGATCATTGGA 
TGGGGCTTTACGAAGCAGAATGGAGGGAAGATGT CTGACATACTGCTG CAGG CGTCAGTCCA 
GGT CATTGACAGCAC ACGGTGC AATG CAGACG ATG CGT AC CAGGGGGAAGTCAC CG AGAAGA 
TGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGTGACAGTGGTGGGCCC 
CTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGGCTGCGG 
GGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATG 
TCTGGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTG 
CCCTG C CC AC CTGGGGAT CC CC CAAAGTCAGACACAG AGC AAGAGTCCC CTTGGGTACACCC 
CTCTGCCCACAGCCTCAGCATTTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCC 
TCGCAGCCCAGAGGCGCCCAGAGGAAGTCAGCAGCCCTAGCTCGGCCACACTTGGTGCTCCC 
AGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCTCAGGGGTATTGCTAAGCCAAGAA 
GGAACTTTCCCACACTACTGAATGGAAGCAGGCTGTCTTGTAAAAGCCCAGATCACTGTGGG 
CTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAAGCCTA 
CTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTT 
ACCTACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCT 
CTGGCAAAAAAAAAAAA 




WO 00/12708 



PCT/US99/20111 
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></usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA68885 
xsubunit 1 of 1, 432 aa, 1 stop 
><MW: 47644, pi: 5.18, NX(S/T): 2 

MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPI I IALLSLASI I IVWLIKVILDKYYF 
LCGQPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWF 
SACFDNFTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSL 
HCLACGKS LKTPRWGGEEAS VDS WPWQVS I QYDKQHVCGGS I LDPHWVLTAAHCFRKHTDV 
FNWKVRAGSDKLGSFPSLAVAKII IIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFD 
EELTPATPLWI IGWGFTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGI PE 
GGVDTCQGDSGGPLMYQSDQWHVVGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE 157 

GGGCTGAGGCACTGAGAGACCGGAAAGCCTGGCATTCCAGAGGGAGGGAAACGCAGCGGCATCCCCAGGCTCCAG 

AGCTCCCTGGTGACAGTCTGTGGCTGAGCATGGCCCTCCCAGCCCTGGGCCTGGACCCCTGGAGCCTCCTGGGCC 
TTTTCCTCTTCCAACTGCTTC^GCTGCTGCTGC^ 

GGGTCAGATACTATGCAGGGGATGAACGTAGGGCACTTAGCTTCTTCCACCAGAAGGGCCTCCAGGATTTTGACA 
CTCTGC^CCTGAGTGGTGATGGAAATACTCTCTACGTGGGGGCTCGAGAAGCCATTCTGGCCTTGGATATCCAGG 
ATCCAGGGGTCCCCAGGCTAAAGAACATGATACCGTGGCCAGCCAGTGACAGAAAAAAGAGTGAATGTGCCTTTA 
AGAAGAAGAGCAATGAGACACAGTGTTTCAACTTCATCCGTGTCCTGGTTTCTTACAATGTCACCCATCTCTACA 
CCTGCGGC^CCTTCGCCTTCAGCCCTGCTTGTACCTTC^TTGAACTTCAAGATTCCTACCTGTTGCCCATCTCOT 
AGGACAAGGTCATGGAGGGAAAAGGCCAAAGCCCCTTTGACCCCGCTCACAAGCATACGGCTGTCTTGGTGGATG 
GGATGCTCTATTCTGGTACTATGAACAACTTCCTGGGCAGTGAGCCCATCCTGATGCGCACACTGGGATCCCAGC 
CTGTCCTCAAGACCGACAACTTCCTCCGCTGGCTGCATCATGACGCCTCCTTTGTGGCAGCCATCCCTTCGACCC 
AGGTCGTCTACTTCTTCTTCGAGGAGACAGCCAGCGAGTTTGACTTCTTTGAGAGGCTCCACACATCGCGGGTGG 
CTAGAGTCTGCAAGAATGACGTGGGCGGCGAAAAGCTGCTGCAGAAGAAGTGGACCACCTTCCTGAAGGCCCAGC 
TGCTCTGCACCCAGCCGGGGCAGCTGCCCTTCAACGTCATCCGCCACGCGGTCCTGCTCCCCGCCGATTCTCCCA 
CAGCTCCCCACATCTACGCAGTCTTCACCTCCCAGTGGCAGGTTGGCGGGACCAGGAGCTCTGCGGTTTGTGCCT 
TCTCTCTCTTGGACATTGAACGTGTCTTTAAGGGGAAATACAAAGAGTTGAACAAAGAAACTTCACGCTGGACTA 
CTTATAGGGGCCCTGAGACCAACCCCCGGCCAGGCAGTTGCTCAGTGGGCCCCTCCTCTGATAAGGCCCTGACCT 
TCATGAAGGACCATTTCCTGATGGATGAGCAAGTGGTGGGGACGCCCCTGCTGGTGAAATCTGGCGTGGAGTATA 
CACGGCTTGCAGTGGAGACAGCCCAGGGCCTTGATGGGCACAGCCATCTTGTCATGTACCTGGGAACCACCACAG 
GGTCGCTCCACAAGGCTGTGGTAAGTGGGGACAGCAGTGCTCATCTGGTGGAAGAGATTCAGCTGTTCCCTGACC 
CTGAACCTGTTCGCAACCTGCAGCTGGCCCCCACCCAGGGTGCAGTGTTTGTAGGCTTCTCAGGAGGTGTCTGGA 
GGGTGCCCCGAGCCAACTGTAGTGTCTATGAGAGCTGTGTGGACTGTGTCCTTGCCCGGGACCCCCACTGTGCCT 
GGGACCCTGAGTCCCGAACCTGTTGCCTCCTGTCTGCCCCCAACCTGAACTCCTGGAAGCAGGACATGGAGCGGG 
GGAACCCAGAGTGGGCATGTGCCAGTGGCCCCATGAGCAGGAGCCTTCGGCCTCAGAGCCGCCCGCAAATCATTA 
AAGAAGTCCTGGCTGTCCCCAACTCCATCCTGGAGCTCCCCTGCCCCCACCTGTCAGCCTTGGCCTCTTATTATT 
GGAGTCATGGCCCAGCAGCAGTCCCAGAAGCCTCTTCCACTGTCTACAATGGCTCCCTCTTGCTGATAGTGCAGG 
ATGGAGTTGGGGGTCTCTACCAGTGCTGGGCAACTGAGAATGGCTTTTCATACCCTGTGATCTCCTACTGGGTGG 
ACAGCCAGGACCAGACCCTGGCCCTGGATCCTGAACTGGCAGGCATCCCCCGGGAGCATGTGAAGGTCCCGTTGA 
CCAGGGTCAGTGGTGGGGCCGCCCTGGCTGCCCAGCAGTCCTACTGGCCCCACTTTGTCACTGTCACTGTCCTCT 
TTGCCTTAGTGCTTTCAGGAGCCCTCATCATCCTCGTGGCC^CCCCATTGAGAGCACTCCGGGCTCGGGGCAAGG 
TCCAGGGCTGTGAGACCCTGCGCCCTGGGGAGAAGGCCCTO^ 

AATGCAGGACCTCTGCCAGTGATGTGGACGCTGACAACAACTGCCTAGGCACTGAGGTAGCTTAA^ 

CAGGCCGGGG CTGCGGTGCAGGCACCTGGCCATGCTGGCTGGGCGGCC CAAGCACAGCCCTGACTAGGATGACAG 

CAGCACAAAAGACCACCTTTCTCCCCTGAGAGGAGCTTCTGCTACTCTGCATCACTGATGACACTCA 

ATGCACAGCAGTCTGCCTCCCCTATGGGACTCCCTTCTACCAAGCACATGAGCTCTCTAACAGGGTGGGGGCTAC 

CCCCAGACCTGCTCCTACACTGATATTGAAGAACCTGGAGAGGATCCTTCAGTTCTGGCCATTCCAGGGACCCTC 

CAGAAACACAGTGTTTCAAGAGACCCTAAAAAACCTGCCTGTCCCAGGACCCTATGGTAATGAACACCAAACATC 

TAAACAATCATATGCTAACATGCCACTCCTGGAAACTCC^ 

TCTCCCAGGGTCATGCAGGGATCTGCTCCCTCCTGCTTCCCTTACCAGTCGTGCACCGCTGACTCCCAGGAAGTC 
TTTCCTGAAGTCTGACCACCTTTCTTCTTGCTTCAGTTGGGGCAGACTCTGATCCCTTCTGCCCTGGCAGAATGG 
GAGGGGTAATCTGAGCCTTCTTCACTCCTTTACCCTAGCTGACCCCTTCACCTCTCCCCCTCCCTTTTCCTTTGT 
TTTGGGATTCAGAAAACTGCTTGTCAGAGACTGTTTATTTTTTATTAAAAATATAAGGCTTAAAAAAA 
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FIGURE 158 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA71166 
xsubunit 1 of 1, 761 aa, 1 stop 
><MW: 83574, pi: 6.78, NX(S/T): 4 

MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKGLQ 
DFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSNETQC 
FNFI RVLVS YNVTHLYTCGTFAFS PACT F I ELQDS YLLP I SEDKVMEGKGQS PFDPAHKHTA 

VLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPSTQVVYFFFE 

ETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLPFNVIRHAVLL 

PADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPET 

NPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAVETAQGLPGHSHLVM 

YLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRAN 

CSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMERGNPEWACASGPMSRSLRP 

QSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPEASSTVYNGSLLLIVQDGVGG 

LYQCWATENGFSYPVISYWDSQDQTLALDPELAGIPREHVKVPLTRVSGGAALAAQQSYWP 

HFVTVTVLFALVLSGALIILVASPLRALRARGKVQGCETLRPGEKAPLSREQHLQSPKECRT 
SASDVDADNNCLGTEVA 

Signal peptide: 

amino acids 1-3 0 

Transmembrane domains : 

amino acids 136-156, 222-247, 474-490, 685-704 
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FIGURE 159 



AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGAAG 

TGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCTGCTC 

TCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACTGATGGA 

CCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCACAAAAGAG 

GTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAGTTGGCTTGCA 

CCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGAAGAAACTTTAGG 

TGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGCGTTGCAGTGCAGTC 

ATTTTTTAAATAGAAAGGATAT CATG GATT C CTTAAAGAATG AG AACTT CGACATGGTGATA 

GTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCTTGGGAAGCCATTTGTGGC 

CATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACCAATCCCCTTGTCTTATGTTC 

CAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCGAGTGAAGAATTTTCTG 

ATGTTCTTTAGTTT CTGCAGGAGGCAACAG CACATGCAGTCTACATTTGACAACAC CATCAA 

GGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTCTCATCTTCTACTGAAAGCAGAGTTGT 

GGTTCATTAACTCTGACTTTGCCTTTGATTTTGCTCGACCTCTGCTTCCCAACACTGTTTAT 

GTTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACCACAAGACTTGGAGAAClTCATTGC 

CAAGTTTGGGGACTCTGGTTTTGTCCTTGTGACCTTGGGCTCCATGGTGAACACCTGTCAGA 

ATCC GG AAAT CTT CAAGGAGATGAACAATGCCTTTG CT CAC CTACC CC AAGGGGTGAT ATGG 

AAGTGTCAGTGTTCTCATTGGCCCAAAGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGA 

CTGGCTTCCTCAGAGTGACCTCCTGGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCG 

GGCAGAATAGCATAATGGAGGCCATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTT 

GGAGACCAGCCTGAAAACATGGTC CGAGTAGAAGCCAAAAAGTTTGGTGTTT CTATTCAGTT 

AAAGAAGCTCAAGGCAGAGACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGAT 

ACAAGTCCGCGGCAGTGGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAG 

CGGCTGGTGGGCTGGATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTA 

TGTCTTTCAGCAGCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGC 

TCACTCTGGGGACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGT 

GGGGCCAGAAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGG 

GCGATGTCACCATTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTC 

TAGTTATCTCCTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACTTGC 

TAATTTTG CTAC AAATTCAT C CTTACTAGCTCCTGC CTG CTAGCAGAAATCTTT CCAGTC CT 

CTTGTCCTCCTTTGTTTGCCATCAGCAAGGGCTATGCTGTGATTCTGTCTCTGAGTGACTTG 

GACCACTGACCCTCAGATTTCCAGCCTTAAAATCCACCTTCCTTCTCATGCGCCTCTCCGAA 

TCACACCCTGACTCTTCCAGCCTCCATGTCCAGACCTAGTCAGCCTCTCTCACTCCTGCCCC 

TACTAT CTAT CATGGAATAACATCC AAGAAAGACACCTTG CATATT CTTT CAGTTT CTGTTT 

TGTTCTCCCACATATTCTCTTCAATGCTCAGGAAGCCTGCCCTGTGCTTGAGAGTTCAGGGC 

CGGACACAGGCTCACAGGTCTCCACATTGGGTCCCTGTCTCTGGTGCCCACAGTGAGCTCCT 

TCTTGGCTGAGCAGGCATGGAGACTGTAGGTTTCCAGATTTCCTGAAAAATAAAAGTTTACA 

GCGTTATCTCTCCCCAACCTCACTAA 
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FIGURE 160 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA71169 
xsubunit 1 of 1, 523 aa, 1 stop 
><MW: 59581, pi: 8.68, NX(S/T): 1 

MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGP 
FMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQCSHF 
LNRKD I MDSLKNENFDMV I VETFD YCP FL I AEKLGKPF VAI LSTS FGSLE FGLP I PLS YVP V 
FRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHLLLKAELWF 
INSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSKVNTCQNP 
EI FKEMNNAFAHLPQGVI WKCQCSHWPKDVHLAANVKIVDWLPQSDLLAHPS IRLFVTHGGQ 
NSIMEAIQHGVPrWGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLALKMKQJMEDKRYK 
S AAVAAS V I LRSH PLS PTQRLVGW I DHVLQTGGATHLKP YVFQQ PWHEQ YLFDVF VFLLGLT 
LGTLWLCGKLLGMAVWWLRGARKVKET 

Signal peptide: 

amino acids 1-19 

Transmembrane domain : 

amino acids 483-504 
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FIGURE 161 



GGGCTGTTGATTTGTGGGGGATTTTGAAGAGAGGAGGAATAGGAGGAAGGGGTTGAGGGGCT 

GCCTCTGGCATATGCACACACTCACACATT CTGTCACACC CGTCACACAC ACATAC CATGTT 

CTCCATCCCCCCAGGTCCAGCCCTCAGTGCTGTCCCATCCAGCAGGGCTACCCTGAAGCTCT 

GGCTGCAGCCCTCCCGTCCAGTGGGCAGGCGGCTTCATCCCTCCTTTCTCTCCCAAAGCCCA 

ACTGCTGTCACTGCATGCTCTGCCAAGGAGGAGGGAACTGCAGTGACAGCAGGAGTAAGAGT 

GGG AGG CAGGAC AG AG CTGGGA CACAGGTATGGAGAGGGGGTT C AG CGAGCCT AGAGAGGGC 

AGACTAT C AGGGTG CCGG CG GTG AGAATCC AGGG AG AGGAGCGGAAACAGAAGAGGGG GAGA 

AGACCGGGGCACTTGTGGGTTGCAGAGCCCCTCAGCCATGTTGGGAGCCAAGCCACACTGGC 

TACCAGGTCCCCTACACAGTCCCGGGCTGCCCTTGGTTCTGGTGCTTCTGGCCCTGGGGGCC 

GGGTGGGCCCAGGAGGGGTCAGAGCCCGTCCTGCTGGAGGGGGAGTGCCTGGTGGTCTGTGA 

GCCTGGCCGAGCTGCTGCAGGGGGGCCCGGGGGAGCAGCCCTGGGAGAGGCACCCCCTGGGC 

GAGTGGCATTTGCTGCGGTCCGAAGCCACCACCATGAGCCAGCAGGGGAAACCGGCAATGGC 

ACCAGTGGGGCCATCTACTTCGACCAGGTCCTGGTGAACGAGGGCGGTGGCTTTGACCGGGC 

CTCTGGCTCCTTCGTAGCCCCTGTCCGGGGTGTCTACAGCTTCCGGTTCCATGTGGTGAAGG 

TGTACAACCGCCAAACTGTCCAGGTGAGCCTGATGCTGAACACGTGGCCTGTCATCTCAGCC 

TTTGCCAATGATCCTGACGTGACCCGGGAGGCAGCCACCAGCTCTGTGCTACTGCCCTTGGA 

CCCTGGGGACCGAGTGTCTCTGCGCCTGCGTCGGGGGAATCTACTGGGTGGTTGGAAATACT 

CAAGTTTCTCTGGCTTCCTCATCTTCCCTCTCTGAGGACCCAAGTCTTTCAAGCACAAGAAT 

CCAGCCCCTGACAACTTTCTTCTGCCCTCTCTTGCCCCAGAAACAGCAGAGGCAGGAGAGAG* 

ACTCCCTCTGGCTCCTATCCCACCTCTTTGCATGGGACCCTGTGCCAAACACCCAAGTTTAA 

GAGAAGAGTAGAGCTGTGGCATCTCCAGACCAGGCCTTTCCACCCACCCACCCCCAGTTACC 

CTCCCAGCCACCTGCTGCATCTGTTCCTGCCTGCAGCCCTAGGATCAGGGCAAGGTTTGGCA 

AGAAGGAAGATCTGCACTACTTTGCGGCCTCTGCTCCTCCGGTTCCCCCACCCCAGCTTCCT 

GCTCAATGCTGATCAGGGACAGGTGGCGCAGGTGAGCCTGACAGGCCCCCACAGGAGCCCAG 

ATGGACAAGCCTCAGCGTAC CCTGCAGGCTTCTTCCTGTGAGGAAAGC CAGCATCACGGATC 

TCAGCCAGCACCGTCAGAAGCTGAGCCAGCACCGTATGGGCTAGGGTGGGAGGCTCAGCCAC 

AGGC AGAAGGGTGGGAAGGG CC TGGAGT CTGTGG CTGGTGAGGAAGGAAGGAGGGTGTATTG 

TCTAG ACTGAAC ATGG TACAC ATTCTGC ATGTAT AG CAGAGCAG CCAGCAGGTAG CAATC CT 

GGCTGTCCTTCTATGCTGGATCCCAGATGGACTCTGGCCCTTACCTCCCCACCTGAGATTAG 

GGTGAGTGTGTTTGCT CTGGCTGAGAGCAGAGCTGAGAGCAGGTATACAGAG CTGGAAGTGG 

ACCATGGAAAACATCGATAACCATGCATCCTCTTGCTTGGCCACCTCCTGAAACTGCTCCAC 

CTTTGAAGTTTGAACTTTAGTCCCTCCACACTCTGACTGCTGCCTCCTTCCTCCCAGCTCTC 

TCACTGAGTTATCTTCACTGTACCTGTTCCAGCATATCCCCACTATCTCTCTTTCTCCTGAT 

CTGTGCTGTCTTATTCTCCTCCTTAGGCTTCCTATTACCTGGGATTCCATGATTCATTCCTT 

CAGACCCTCTCCTGCCAGTATGCTAAACCCTCCCTCTCTCTTTCTTATCCCGCTGTCCCATT 

GG CCCAGCCTGGATGAAT CTATCAATAAAA CAACTAGAG AATGGTGGT CAGTG AGACACTAT 

AGAATTACTAAGGAGAAGATG CC T CTGGAGTTTGGATCGGGTGTTACAGGTACAAGTAGGT A 

TGTTGCAGAGGAAAATAAATATCAAACTGTATACTAAAATTAAAAA 
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FIGURE 162 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA71180 
xsubunit 1 of 1, 205 aa, 1 stop 
><MW: 21521, pi: 7.07, NX(S/T): 1 

MLGAKPHWLPGPLHSPGLPLVLVLLALGAGWAQEGSEPVLLEGECLWCEPGRAAAGGPGGA 
ALGE AP PGRVAFAAVRSHHHE PAGETGNGTSG AI YFDQ VLVNEGGGFDRASGS FVAPVRGVY 
SFRFHWKVYNRQTVQ VS LMLNTWPV I S AFAND PDVTREAATS S VLL PLDPGDRVSLRLRRG 
NLLGGWKYS S FSGFL I F PL 

Signal peptide: 

amino acids 1-32 
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FIGURE 163 

GCTGTTTCTCTCGCGCCACCACTGGCCGCCGGCCGCAGCTCCAGGTGTCCTAGCCGCCCAGC 

CTCGACGCCGTCCCGGGACCCCTGTGCTCTGCGCGAAGCCCTGGCCCCGGGGGCCGGGGCAT 

GGGCCAGGGGCGCGGGGTGAAGCGGCTTCCCGCGGGGCCGTGACTGGGCGGGCTTCAGCCAT 

GAAGACCCTCATAGCCGCCTACTCCGGGGTCCTGCGCGGCGAGCGTCAGGCCGAGGCTGACC 

GGAGCCAGCGCTCTCACGGAGGACCTGCGCTGTCGCGCGAGGGGTCTGGGAGATGGGGCACT 

GGATCCAGCATCCTCTCCGCCCTCCAGGACCTCTTCTCTGTCACCTGGCTCAATAGGTCCAA 

GGTGGAAAAGCAGCTACAGGTCATCTCAGTGCTCCAGTGGGTCCTGTCCTTCCTTGTACTGG 

GAGTGGCCTGCAGTGCCATCCTCATGTACATATTCTGCACTGATTGCTGGCTCATCGCTGTG 

CTCTACTTCACTTGGCTGGTGTTTGACTGGAACACACCCAAGAAAGGTGGCAGGAGGTCACA 

GTGGGTCCGAAACTGGGCTGTGTGGCGCTACTTTCGAGACTACTTTCCCATCCAGCTGGTGA 

AGACACACAACCTGCTGACCACCAGGAACTATATCTTTGGATACCACCCCCATGGTATCATG 

GGCCTGGGTGCCTTCTGCAACTTCAGCACAGAGGCCACAGAAGTGAGCAAGAAGTTCCCAGG 

CATACGGCCTTACCTGGCTACACTGGCAGGCAACTTCCGAATGCCTGTGTTGAGGGAGTACC 

TGATGTCTGGAGGTATCTGCCCTGTCAGCCGGGACACCATAGACTATTTGCTTTCAAAGAAT 

GGGAGTGGCAATGCTATCATCATCGTGGTCGGGGGTGCGGCTGAGTCTCTGAGCTCCATGCC 

TGGCAAGAATGCAGTCACCCTGCGGAACCGCAAGGGCTTTGTGAAACTGGCCCTGCGTCATG 

GAGCTGACCTGGTTCCCATCTACTCCTTTGGAGAGAATGAAGTGTACAAGCAGGTGATCTTC 

GAGGAGGGCTCCTGGGGCCGATGGGTCCAGAAGAAGTTCCAGAAATACATTGGTTTCGCCCC 

ATGCATCTTCCATGGTCGAGGCCTCTTCTCCTCCGACACCTGGGGGCTGGTGCCCTACTCCA 

AGCCCATCACCACTGTTGTGGGAGAGCCCATCACCATCCCCAAGCTGGAGCACCCAACCCAG 

CAAGACATCGACCTGTACCACACCATGTACATGGAGGCCCTGGTGAAGCTCTTCGACAAGCA 

CAAGACCAAGTTCGGCCTCCCGGAGACTGAGGTCCTGGAGGTGAACIgAGCCAGCCTTCGGG 

GCCAATTCCCTGGAGGAACCAGCTGCAAATCACTTTTTTGCTCTGTAAATTTGGAAGTGTCA 

TGGGTGTCTGTGGGTTATTTAAAAGAAATTATAACAATTTTGCTAAACCAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

AAAAAAAAAAAAAAAAAAAA 
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FIGURE 164 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA71184 
xsubunit 1 of 1, 388 aa, 1 stop 
><MW: 43831, pi: 9.64, NX(S/T): 3 

MKTLIAAYSGVLRGERQAEADRSQRSHGGPALSREGSGRWGTGSSILSALQDLFSVTWLNRS 
KVEKQLQVISVLQWVLSFLVLGVACSAILMYIFCTDCWLIAVLYFTWLVFDWNTPKKGGRRS 
QWVRNWAVWRYFRD YF P I QL VKTHNLLTTRN Y I FGYH PHG IMGLGAFCNFSTEATE VS KKF P 
G I RP YLATLAGNFRMP VLRE YLMSGG I CPVSRDT I DYLLSKNGSGNAI 1 1 WGGAAES LS SM 
PGKNAVTLRNRKGF VKLALRHGADLVP I YS FGENEVYKQV I FE EGS WGRWVQKKFQKY I GFA 

PCIFHGRGLFSSDTWGLVPYSKPITTWGEPITIPKLEHPTQQDIDLYHTMYMEALVKLFDK 
HKTKFGLPETEVLEVN 

Important features of the protein: 
Transmembrane domain: 

amino acids 76-97 

N-glycosylation sites. 

amino acids 60-63, 173-176, 228-231 

N-myristoylation sites. 

amino acids 10-15, 41-46, 84-89, 120-125, 169-174, 229-234, 240- 
245, 318-323, 378-383 
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FIGURE 165 



GGGCGGCGGGATGGGGGCCGGGGGCGGCGGGCGCCGCACTCGCTGAGGCCCCGACGCAGGGCCGGGCCGGGCCCA 
GGGCCGAGGAGCGCGGCGGCCAGAGCGGGGCCGCGGAGGCGACGCCGGGGACGCCCGCGCGACGAGCAGGTGGCG 
GCGGCTGCAGGCTTGTCCAGCCGGAAGCCCTGAGGGCAGCTGTTCCCACTGGCTCTGCTGACCTTGTGCCTTGGA 
CGGCTGTCCTCAGCGAGGGGCCGTGCACCCGCTCCTGAGCAGCGCCATGGGCCTGCTGGCCTTCCTGAAGACCCA 
GTTCGTGCTGCACCTGCTGGTCGGCTTTGTCTTCGTGGTGAGTGGTCTGGTCATCAACTTCGTCCAGCTGTGCAC 
GCTGGCGCTCTGGCCGGTCAGCAAGCAGCTCTACCGCCGCCTCAACTGCCGCCTCGCCTACTCACTCTGGAGCCA 
ACTGGTCATGCTGCTGGAGTGGTGGTCCTGCACGGAGTGTACACTGTTCACGGACCAGGCCACGGTAGAGCGCTT 
TGGGAAGGAGCACGCAGTCATCATCCTCAACCACAACTTCGAGATCGACTTCCTCTGTGGGTGGACCATGTGTGA 
GCGCTTCGGAGTGCTGGGGAGCTCCAAGGTCCTCGCTAAGAAGGAGCTGCTCTACGTGCCCCTCATCGGCTGGAC 
GTGGTACTTTCTGGAGATTGTGTTCTGCAAGCGGAAGTGGGAGGAGGACCGGGACACCGTGGTCGAAGGGCTGAG 
GCGCCTGTCGGACTACCCCGAGTACATGTGGTTTCTCCTGTACTGCGAGGGGACGCGCTTCACGGAGACCAAGCA 
CCGCGTTAGCATGGAGGTGGCGGCTGCTAAGGGGCTTCCTGTCCTCAAGTACCACCTGCTGCCGCGGACCAAGGG 
CTTCACCACCGCAGTCAAGTGCCTCCGGGGGACAGTCGCAGCTGTCTATGATGTAACCCTGAACTTCAGAGGAAA 
CAAGAACCCGTCCCTGCTGGGGATCCTCTACGGGAAGAAGTACGAGGCGGACATGTGCGTGAGGAGATTTCCTCT 
GGAAGACATC CCGCTGGATGAAAAGGAAG CAGCTCAGTGGCTTCATAAACTGTACCAGGAGAAGGACG CG CTCCA 
GGAGATATATAATCAGAAGGGCATGTTTCCAGGGGAGCAGTTTAAGCCTGCCCGGAGGCCGTGGACCCTCCTGT^A 
CTTCCTGTCCTGGGCCACCATTCTCCTGTCTCCCCTCTTCAGTTTTGTCTTGGGCGTCTTTGCCACJCGGATCACC 
TCTCCTGATCCTGACTTTCTTGGGGTTTGTGGGAGCAGCTTCCTTTGGAGTTCGCAGACTGATAGGAGAATCGCT 
TGAACCTGGGAGGTGGAGATTGCAGTCAGCTGAGATGGCATCACTGTACTCCAGCCTAGGCAACAGAGCAAGACT 
CAGTCTCAAAAAAAAAAAAAAACAAAAAAACCCCAGAAATTCTGGAGTT 

ATTCACTAGAGGCTGAACAGCAGATTTGAGCAGGCAGAAAAAAATCAGCAAGCTTGAAGATGGTACCTTGAGATT 

TTTCAGGCTAATGAAAAAAGAATGAAGGAAAATTAACAGCCTCAGAGACCCATGGTGCACCGTCACAC^ 

CATATGCATGATGAG AGTCC CAGAAGGAG AGG AGAGAAAGGGTCAGAAAG AATGGC CACAAG CTGATGAAAAACA 

GTAACCTACCCACTCAGGAAGCTCAGTGAACTCCAATGAGGATGAATATCAGAGATCC7VCACCTAGATATTTCAT 

AATCAAAGTGTCAAATGACAAAGAATCTTGAAAGCAGCAAGAGATGAGCAACTTATCTTGTTCAAAGGATCT^ 

ATCAGATTAACAGCTCATTTCTCCTCAGAAATCATGGGAGCCAGGAGATAGTGGGATGAACACTGTTGAAGGCAA 

AACCTTCAACTGTAATTATTGGACTTTTGAGTCTTAGATGGTCCTGACCTCTTTGTCTTCAGGGAC^GTTTTTCA 

ATTTAATCCCTAATAACAATTAGTCAAGCTTCCTTGACCTGTAGGAAGGCCTGTCTTTAGGCCGGGCACAGTGGC 

TTACACCTGTAATCCCAGCACTTTGGGAGGCCCAGACGGGTGGATCATTTGGGGTCAGGCTGATCTCAAACTCCT 

GAGTTCAGGTGATCTGCCCGCCTCAGCCTCCCAAAGTGTTGTGATTGCAGGCGTGAGCCACTGCGCCTGGCCGGA 

ATTTCTTTTTAAGGCTGAATGATGGGGGCCAGK3CACGATGGCTCACGCCTGTGATCCCAAGTAGCTTGGATTGTA 

AACATGCACCACCATGCCTGGCTAATTTTTCTATTTTTAGTAGAGACGTGTTAGCCAGGCTGGTCTCGATCTCCT 

GACCTCAAGTGACCACCTGCCTCAGCCTCCCAAAGTACTGGGATTACAGGCGTGAGCCACTGTGCCTGGCCTTGA 

GCATCTTGTGATGTGCTTATTGGCC^TTTGTATATCTTCTATCTTCTTTGGGGT^AATGTCTGTTCAAGTCCTTTG 

CCTTTTTAAATTTTTATTATTTATTTATTTATTTATTTTGAGACAGGGTCTTGTTCTGTTGCCCAGGCTGGAGTA 

C^GTGGCACAGTCTTGGCTCACTGCAGCCTCGACCTCCTGGGCTGCAGTGATCOTCCCACCTCAGCCT^ 

AGCTGTATTTTTTTGTATTTTGTATTTTGTAGCTGTAGTTTTTGTATTTTTTGTGGAGACAGCATTTCACCATGA 

TGCCCAGGCTGGTCTTGAACTCCTGAGCTCAAGTGATCTGCCTGCTTCAGCCTCCCAAAGTGCTGGGATTACAGA 

CATGAGCCACTGCACCTGGCAAACTCCCAAAATTCAACACACACACACAAAAAACCACCTGATTC7VAAA 

GAGGGGCCGGGTGTGGCCCCAACTACCAGGGAGACTGAAGTGGGAGGATCGCTTGGGCATGAGAAGTCGAGGCTG 

CAGTGAGTCGAGGTTGTGCGACTGCATTCCAGCCTGGACAACAGAGTGAGACCCTGTCTC 
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FIGURE 166 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA71213 
xsubunit 1 of 1, 368 aa, 1 stop 
><MW: 42550, pi: 9.11, NX(S/T): 1 

MGLLAFLKTQFVLHLLVGFVFVVSGLVINFVQLCTLALWPVSKQLYRR 
MLLEWWSCTECTLFTDQAWERFGKEHAVIILNW 

ELLYVPLIGWTWYFLEIVFCKRKWEEDRDTWEGLRRLSDYPEYMWFLLYCEGTRFTETKHR 
VSMEVAAAKGLPVLKYHLLPRTKGFTTAVKCLRGTVAAVYDVTLNFRGNKNPSLLGILYGKK 
YEADMCVRRFPLEDIPLDEKEAAQWLHKLYQEKDALQEIYNQKGMFPGEQFKPARRPWTLLN 
FLSWATILLSPLFSFVLGVFASGSPLLILTFLGFVGAASFGVRRLIGESLEPGRWRLQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

Transmembrane domains: 

amino acids 307-323, 335-352 

Tyrosine kinase phosphorylation sites. 

amino acids 160-168, 161-169 
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FIGURE 167 



GATATTCTTTATTTTTAAGAATCTGAAGTACTAIGCATCACTCCCTCCAATGTCCTGGGGCA 

GCCACCAGGCATATTCATCTTTGTGTGTGTTTTTCTTTTGCTTTAGCACTGGGGCACTTCTT 

GCTTATTTCTTTGGTAGGAAAGGGGCTCAGTTTGTCTTGTGGGGTTGGTGGCAGGCAGGCCG 

GCTTACGCCTGATACGGCCCTGGGTTAGAAGGGAAGGGAAGATAAACTTTTATACAAATGGG 

GATAGCTGGGGTCTGAGACCTGCTTCCTCAGTAAAATTCCTGGGATCTGCCTATACCTTCTT 

TTCTCTAACCTGGCATACCCTGCTTAAAGCCTCTCAGGGCTTCTCTCTGTTCTTAGGATCAA 

AGTATTTAGAGCTACAAGAGCCCTCATGGTCTGGCCCCTGCCCCCCTGGCCAGCTTCATTGT 

ACATGTGGTGTTCTCTTGTCGTTCCTGTAATGTGGTATGCCATGGGGTCTTTGCACAAGCCT 

TTCCTCTTTGGCTGGACACTGTTCCCTGCCCCCCCCATACTCTTCCTACTTAATATGTAGTC 

ATCCTGCAGATTTCAATTCTAACATCATTTTCTCCAGGGATCCTGGCCTGACAGAATCTCAT 

CTTGTTTAATGCTCTCATAAGACCACTTGTTTCCCTTTTGCAGCACTTGCCACTCAGTTGTA 

TCTTTATGTGCGTTTGTGGTTGTATGGGTTGTGTCTGTTCCCCAGAATGCCCAGCTCTGAGC 

TGCGTGAGGGTCAAGGGCATTGCTGTGCCTGCCAGGTATAGTGCCTACATGTGGTGGGTGCT 

CATGTTTTAGAGACTAAATGGAGGAGGAGATGAGGAAAAGATTGAAATCTCTCAGTTCACCA 

GATGGTGTAGGGCCCAGCATTGTAAATTCACACGTTGACTGTGCTTGTGAATTATCTGGGGA 

TGCAGGTCCTGATTCAGTAGGCCCAGGTTGGGCATCTCTAACAAACTCCCACGTGATGCTGA 

TGCTGGTCCTATGAACTATACTAAATAGTAAGAATCTATGGAGCCAGGCTGGGCATGGTGGC 

TCACACCTATGATCCCAGCACTTTGGGAGGCTGAGGCAGGCTGATCACCTGGAGTCAGGATT 

TCAAGACTAGCCTGGCCAACATGGTGGAACCCCATCTGTACTAAAAATACACAAATTAGCTG 

GGCATGGTGGCACATGCCTGTAGTCCCAGCTACTTGGGAGGCTGAAGCAAGAGAATCGCTTG 

AACCTGGGAGGCGGAGGTTGCAGTGAGCCGAGATCAGGCCACTGTATTCCAACCAGGGTGAC 

AGAGTGAGACTCTATGTCCAAAAAAAAAAAAA 
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FIGURE 16S 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA71234 
xsubunit 1 of 1, 143 aa, 1 stop 
><MW: 15624, pi: 9.58, NX(S/T): 0 

MHHSLQCPGAATRHIHLCVCFSFALALGHFLLISLVGKGLSLSCGVGGRQAGLRLIRPWVRR 

EGKINFYTNGDSWGLRPASSVKFLGSAYTFFSLTWHTLLKASQGFSLFLGSKYLELQEPSWS 
GPCPPGQLHCTCGVLLSFL 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
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FIGURE 169 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGA 

TTATAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCAC 

AGGAGTTGAACTGCTAGGATTCTGACTAIGCTGTGGTGGCTAGTGCTCCTACTCCTACCTAC 

ATTAAAATCTGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGT 

CACTGTGGCTCTGGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCAC 

ACCGTCCCCTCGAAGCCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTC 

ACCAACTGTCTCACGTCTGGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTA 

GCTGCGGCTTTCAAGGTGGGCCTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTT 

CATAGGCGATGGCTCCCACTGCCCAGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGG 

CCAGGACGGGCCGTGGACACCTGCTCAGAAGCAGTGGGTGAGACATCACGCTGCCCGCCCAT 

CTAACCTTTTCATGTCCTGCACATCACCTGATCCATGGGCTAATCTGAACTCTGTCCCAAGG 

AACCCAGAGCTTGAGTGAGCTGTGGCTCAGACCCAGAAGGGGTCTGCTTAGACCACCTGGTT 

TATGTGACAGGACTTGCATTCTCCTGGAACATGAGGGAACGCCGGAGGAAAGCAAAGTGGCA 

GGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATGGAGGTGTTGGGTTATCACAAGGC 

ATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTGCCGATGGCGCATGACACACT 

CGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGATCCACGTACCAGCTGCTG 

AAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAATCTGCGATCACCAG 

CCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCCCTCCTTCCCTC 

TGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGGGGCTAATG 

GCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCTGTGCG 

AACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGGCT 

CAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGT 

TCAAATGATCTCCAAGGGCCCTTATACCCCAGGAGACTTTGATTTGAATTTGAAACCCCAAA 

TCCAAACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATG 

CCAACATTTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTG 

GCCAACATGGTGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGT 

GTGTGCCTGTATCCCAGCTACTCGGGAGGCTGAGACAGGAGAATTACTTGAACCTGGGAGGT 

GAAGGAGGCTGAGACAGGAGAATCACTTCAGCCTGAGCAACACAGCGAGACTCTGTCTCAGA 
AAAAATAAAAAAAGAATTATGGTTATTTGTAA 
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FIGURE 170 



></usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA71277 
xsubunit 1 of 1, 109 aa, 1 stop 
><MW: 11822, pi: 8,63, NX (S/T) : 0 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLV7PKPDLHSGTRTEVSTHTVPSKPGTA 
SPCWPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE 171 



GCGGGCCCGCGAGTCCGAGACCTGTCCCAGGAGCTCCAGCTCACGTGACCTGTCACTGCCTC 

CCGCCGCCTCCTGCCCGCGCCASGACCCAGCCGGTGCCCCGGCTCTCCGTGCCCGCCGCGCT 

GGCCCTGGGCTCAGCCGCACTGGGCGCCGCCTTCGCCACTGGCCTCTTCCTGGGGAGGCGGT 

GCCCCCCATGGCGAGGCCGGCGAGAGCAGTGCCTGCTTCCCCCCGAGGACAGCCGCCTGTGG 

CAGTATCTTCTGAGCCGCTCCATGCGGGAGCACCCGGCGCTGCGAAGCCTGAGGCTGCTGAC 

CCTGGAGCAGCCGCAGGGGGATTCTATGATGACCTGCGAGCAGGCCCAGCTCTTGGCCAACC 

TGGCGCGGCTCATCCAGGCCAAGAAGGCGCTGGACCTGGGCACCTTCACGGGCTACTCCGCC 

CTGGCCCTGGCCCTGGCGCTGCCCGCGGACGGGCGCGTGGTGACCTGCGAGGTGGACGCGCA 

GCCCCCGGAGCTGGGACGGCCCCTGTGGAGGCAGGCCGAGGCGGAGCACAAGATCGACCTCC 

GGCTGAAGCCCGCCTTGGAGACCCTGGACGAGCTGCTGGCGGCGGGCGAGGCCGGCACCTTC 

GACGTGGCCGTGGTGGATGCGGACAAGGAGAACTGCTCCGCCTACTACGAGCGCTGCCTGCA 

GCTGCTGCGACCCGGAGGCATCCTCGCCGTCCTCAGAGTCCTGTGGCGCGGGAAGGTGCTGC 

AACCTCCGAAAGGGGACGTGGCGGCCGAGTGTGTGCGAAACCTAAACGAACGCATCCGGCGG 

GACGTCAGGGTCTACATCAGCCTCCTGCCCCTGGGCGATGGACTCACCTTGGCCTTCAAGAT 

CTAGGGCTGGCCCCTAGTGAGTGGGCTCGAGGGAGGGTTGCCTGGGAACCCCAGGAATTGAC 

CCTGAGTTTTAAATTCGAAAATAAAGTGGGGCTGGGACACAAAAAAAAAAAAAAAAAAA 
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FIGURE 172 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA71282 
xsubunit 1 of 1, 262 aa, 1 stop 
><MW: 28809, pi: 8 . 80 , NX (S/T) : 1 

MTQPVPRLSVPAALALGSAALGAAFATGLFLGRRCPPWRGRREQCLLPPEDSRLWQYLLSRS 
MREHPALRSLRLLTLEQPQGDSMMTCEQAQLLANLAR^^ 

PADGRVVTCEVDAQPPELGRPLWRQAEAEHKIDLRLKPALETLDELLAAGEAGTFDVAVVDA 
DKENCS AY YERCLQLLRPGG I LAVLRVLWRGKVLQ PPKGD VAAE CVRNLNER I RRDVRVY I S 
LLPLGDGLTLAFKI 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

Transmembrane domains: 

amino acids 8-30, 109-130 

N-glycosylation site. 

amino acids 190-193 

Tyrosine kinase phosphorylation site. 

amino acids 238-246 

N-myristoylation sites. 

amino acids 22-27, 28-33, 110-115, 205-210, 255-260 

Amidation sites. 

amino acids 31-34, 3 9-42 
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FIGURE 173 



CCGCCGCCGCAGCCGCTACCGCCGCTGCAGCCGCTTTCCGCGGCCTGGGCCTCTCGCCGTCA 

GCATSCCACACGCCTTCAAGCCCGGGGACTTGGTGTTCGCTAAGATGAAGGGCTACCCTCAC 

TGGCCTGCCAGGATCGACGACATCGCGGATGGCGCCGTGAAGCCCCCACCCAACAAGTACCC 

CATCTTTTTCTTTGGCACACACGAAACAGCCTTCCTGGGACCCAAGGACCTGTTCCCCTACG 

ACAAATGTAAAGACAAGTACGGGAAGCC CAACAAGAGGAAAGGCTT CAATGAAGGGCTGTGG 

GAGAT CCAGAACAACC CC CACG C C AG CT AC AG CGC C CCTC CG CC AGTG AG CT C CTCCGACAG 

CGAGGCCCCCGAGGCCAACCCCGCCGACGG-CAGTGACGCTGACGAGGACGATGAGGACCGGG 

GGGTCATGGCCGTCACAGCGGTAACCGCCACAGCTGCCAGCGACAGGATGGAGAGCGACTCA 

GACTCAGACAAGAGTAGCGACAAC AGTGG C CT GAAG AGGAAG ACGC CTGCGCTAAAGATGTC 

GGTCTCGAAACGAGCCCGAAAGGCCTCCAGCGACCTGGATCAGGCCAGCGTGTCCCCATCCG 

AAGAGG AG AACT CGGAAAGC T CAT CTG AGT CGGAGAAG AC CAGCG ACCAGGACTT CAC AC CT 

GAGAAGAAAGCAGCGGTCCGGGCGCCACGGAGGGGCCCTCTGGGGGGACGGAAAAAAAAGAA 

GGCGCCGTCAGCCTCCGACTCCGACTCCAAGGCCGATTCGGACGGGGCCAAGCCTGAGCCGG 

TGGCCATGGCGCGGTCGGCGTCCTCCTCCTCCTCTTCCTCCTCCTCCTCCGACTCCGATGTG 

TCTGTGAAGAAG CCTC CG AGGGGCAGGAAGCCAGCGGAGAAG CCTCTCCCGAAG CCGCGAGG 

GCGGAAACCGAAGCCTGAACGGCCTCCGTCCAGCTCCAGCAGTGACAGTGACAGCGACGAGG 

TGGAC CGCAT CAGTGAGTGGAAGCGGCGGGACGAGG CG CGGAGG CG CGAG CTGGAGGC CCGG 

CGGCGG CGAGAGCAGG AGGAGGAG CTGCGG CG CCTG CGGGAG CAGGAGAAGGAGGAGAAGGA 

GCGGAGGCGCGAGCGGGCCGACCGCGGGGAGGCTGAGCGGGGCAGCGGCGGCAGCAGCGGGG 

ACGAGCTCAGGGAGGACGATGAGCCCGTCAAGAAGCGGGGACGCAAGGGCCGGGGCCGGGGT 

CCCCCGTCCTCCTCTGACTCCGAGCCCGAGGCCGAGCTGGAGAGAGAGGCCAAGAAATCAGC 

GAAGAAGCCGCAGT CCT C AAGC AC AG AG CC CGCCAGGAAAC CTGGC CAGAAGGAGAAGAGAG 

TGCGGCCCGAGGAGAAGCAACAAGCCAAGCCCGTGAAGGTGGAGCGGACCCGGAAGCGGTCC 

GAGGGCTTCTCGATGGACAGGAAGGTAGAGAAGAAGAAAGAGCCCTCCGTGGAGGAGAAGCT 

GCAGAAGCTGCACAGTGAGATCAAGTTTGCCCTAAAGGTCGACAGCCCGGACGTGAAGAGGT 

GCCTGAATGCCCTAGAGGAGCTGGGAACCCTGCAGGTGACCTCTCAGATCCTCCAGAAGAAC 

ACAG ACGTGGTGGC CAC CTTGAAGAAGATT CG CCGTTACAAAGCGAACAAGGACGT AATGG A 

GAAGGCAGCAGAAGTCTATACCCGGCTCAAGTCGCGGGTCCTCGGCCCAAAGATCGAGGCGG 

TG CAGAAAGTGAACAAGG CTGGGATGG AGAAGGAGAAGG CCG AGGAGAAG CTGGCCGGGG AG 

GAGCTGGCCGGGGAGGAGGCCCCCCAGGAGAAGGCGGAGGACAAGCCCAGCACCGATCTCTC 

AG C C CCAGTGAATGG CGAGG CCACATCACAGAAGGGGGAG AGCG CAGAGGAC AAGGAGCACG 

AGGAGGGTCGGGACTCGGAGGAGGGGCCAAGGTGTGGCTCCTCTGAAGACCTGCACGACAGC 

GTACGGGAGGGTCCCGACCTGGACAGGCCTGGGAGCGACCGGCAGGAGCGCGAGAGGGCACG 

GGGGGACTCGGAGGCCCTGGACGAGGAGAGCTGAGCCGCGGGCAGCCAGGCCCAGCCCCCGC 

CCGAGCTCAGGCTGCCCCTCTCCTTCCCCGGCTCGCAGGAGAGCAGAGCAGAGAACTGTGGG 

GAACGCTGTGCTGTTTGTATTTGTTCCCTTGGGTTTTTTTTTCCTGCCTAATTTCTGTGATT 

TC CAAC CAACATGAAATGACTAT AAACGGTTTTTTAATGA 
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FIGURE 174 



>.</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA71286 
xsubunit 1 of 1, 671 aa, 1 stop 
><MW: 74317, pi: 7.61, NX(S/T): 0 

MPHAFKPGDL VFAKMKG YPHWP AR I DD I ADGAVKPP PNKYP IFF FGTHETAFLGPKDLF P YD 
KCKDKYGKPNKRKGFNEGLWEIQNNPHASYSAPPPVSSSDSEAPEANPADGSDADEDDEDRG 
VMAVTAVTATAASDRME SDS DSDKSS DNSGLKRKTPALKMS VSKRARKAS SDLDQAS VS P SE 
EENSESSSESEKTSDQDFTPEKKAAVRAPRRGPLGGRKKKKAPSASDSDSKADSDGAKPEPV 
AMARSASSSSSSSSSSDSDVSVKKPPRGRKPAEKPLPKPRGRKPKPERPPSSSSSDSDSDEV 
DRISEWKRRDEARRRELEARRRREQEEELRRLREQEKEEKERRRERADRGEAERGSGGSSGD 
ELREDDEPVKKRGRKGRGRGPPSSSDSEPEAELEREAKKSAKKPQSSSTEPARKf GQKEKRV 
RPEEKQQAKPVKVERTRKRSEGFSMDRKVEKKKE PS VEEKLQKLHSE I KFALKVDS PDVKRC 
LNALEELGTLQVTSQILQKNTDWATLKKIRRYKANKDVMEKAAEVYTRLKSRVLGPKIEAV 
QKVNKAGMEKEKAEEKLAGEELAGEEAPQEKAEDKPSTDLSAPVNGEATSQKGESAEDKEHE 
EGRDSEEGPRCGSSEDLHDSVREGPDLDRPGSDRQERERARGDSEALDEES 

Signal peptide: 

amino acids 1-13 
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FIGURE 175 



GTTGGTTCTCCTGGATCTTCACCTTACCAACTGCAGATCTTGGGACTCATCAGCCTCAATAATTATATTAAATTA 
ACACCATTTGAAAGAGAACATTGT TTTC AT C ATG AATG CTAATAAAG ATGAAAGACTT A A AflPf 1 Af3 A arer r« a a fl 

TTTTCACCTTTTTCCTGCTTTGATGATGCTAAGCATGACCATGTTGTTTCTTCCAGTCACTGGCACTTTGAAGCA 
AAATATTCCAAGACrCAAGCTAACCTACAAAGACTTGCTGCTTTCAAATAGCTGTAT 

AGAAGGACTGGATTTTCAAACTCTTCTCTTAGATGAGGAAAGAGGCAGGCTGCTCTTGGGAGCCAAAGACCAC^ 

CTTTCTACTCAGTCTGGTTGACTTAAACAAAAATTTTAAGAAGATTTATTGGCCTGCTGCAAAGGAACGGGTGGA 

ATTATGTAAATTAGCTGGGAAAGATGCCAATACAGAATGTGCAAATTTCATCAGAGTACTTCAGCCCTATAACAA 

AACTCACATATATGTGTGTGGAACTGGAGCATTTC^TCCAATATGTGGGTATATTGATCTTGGAGTCT'ACAAGGA 

GGATATTATATTCAAACTAGACACACATAATTTGGAGTCTGGCAGACTGAAATGTCCTTTCGATCCTCAGCAGCC 

TTTTGCTTCAGTAATGACAGATGAGTACCTCTACTCTTGGAACAGCTO 

CACTCGATCCCTTGGGCCTACTCATGACCACCACTACATCAGAACTGACATTTCAGAGCACTACTGGCTCAATGG 

AGCAAAATTTATTGGAACTTTCTTCATACCAGACACCTACAATCCAGATGATGATAAAATATATTTCTTCTTTCG 

TGAATCATCTCAAGAAGGCAGTACCTCCGATAAAACCATCCTTTCTCGAGTTGGAAGAGTTTGTAAGAATGATGT 

AGGAGGACAACGCAGCCTGATAAACAAGTGGACGACTTTTCTTAAGGCCAGACTGATTTGCTCAATTCCTGGAAG 

TGATGGGGCAGATACTTACTTTGATGAGCTTCAAGATATTTATTTACTCCCCACAAGAGATGAAAGAAATCCTGT 

AGTATATGGAGTCTTTACTACAACCAGCTCCATCTTCAAAGGCTCTGCTGTTTGTGTGTATAGCATGGCTGACAT 

CAGAGCAGTTTTTAATGGTCCATATGCTCATAAGGAAAGTGCAGACCATCGTTGGGTGCAGTATGATGGGAGAAT 

TCCTTATCCACGGCCTGGTACATGTCCAAGCAAAACCTATGACCCACTGATTAAGTCCACCCGAGATTTTCCAGA 

TGATGTCATCAGTTTCATAAAGCGGCACTCTGTGATGTATAAGTCCGTATACCCAGTTGCAGGAGGACCAACGTT 

CAAGAGAATCAATGTGGATTACAGACTGACACAGATAGTGGTGGATCATGTCATTGCAGAAGATGGCCAGTACGA 

TGTAATGTTTCTTGGAACAGACATTGGAACTGTCCTCAAAGTTGTCAGCATTTCAAAGGAAAAGTGGAATATGGA 

AGAGGTAGTGCTGGAGGAGTTGCAGATATTCAAGCACTCATCAATCATCTTGAACATGGAATTGTCTCTGAAGCA 

GCAACAATTGTACATTGGTTCCCGAGATGGATTAGTTCAGCTCTCCTTGCACAGATGCGACACTTATGGGAAAGC 

TTGCGCAGACTGTTGTCTTGCCAGAGACCCCTACTGTGCCTGGGATGGAAATGCATGCTCTCGATATGCTCCTAC 

TTCTAAAAGGAGAGCTAGACGCCAAGATGTAAAATATGGCGACCCAATCACCCAGTGCTGGGACATCGAAGACAG 

CATTAGTCATGAAACTGCTGATGAAAAGGTGATTTTTGGCATTGAATTTAACTCAACCTTTCTGGAATGTATACC 

TAAATCCCAACAAGCAACTATTAAATGGTATATCCAGAGGTCAGGGGATGAGCATCGAGAGGAGTTGAAGCCCGA 

TGAAAGAATCATCAAAACGGAATATGGGCTACTGATTCGAAGTTTGCT^GAAGAAGGATTCTGGGATGTATTACTG 

CAAAGCCCAGGAGCACACTTTCATCCACACCATAGTGAAGCTGACTTTGAATGTCATTGAGAATGAACAGATGGA 

AAATACCCAGAGGGCAGAGCATGAGGAGGGGCAGGTCAAGGATCTATTGGCTGAGTCACGGTTGAGATACAAAGA 

CTACATCCAAATCCTTAGCAGCCCAAACTTCAGCCrrCGACCAGTACTGCGAACAGATGTGGCAC^ 

GAGACAGAGAAACAAGGGGGGCCCAAAGTGGAAGCACATGCAGGAAATGAAGAAGAAACGAAATCGAAGACATCA 

CAGAGACCTGGATGAGCTCCCTAGAGCTGTAGCCACGTAGTTTTCTACTTAATTTAAAGAAAAGAATTCCTTACC 



CACAAGACAATAATCTGAATAAGACAATATGTGATGAATATAAGAAAGGGCAAAAAATTCATTTGAACCAGTT 

CCAAGAACAAATC TTGC AC AAGCAAAGTATAAGAATTATC CTAAAAATAGGGGGTTTACAG TTGTAAATGTTTTA 

TGTTTTGAGTTTTGGAATTTATTGTCATGTAAATAGTTGAGCTAAGCAAGCCCCGAATTTGATAGTGTATAAGGT 

GCTTTATTCCCTCGAATGTCCATTAAGCATGGAATTTACCATGCAGTTGTGCTATGTTCTTATGAACAGATATAT 

CATTCCTATTGAGAACCAGCTACCTTGTGGTAGGGAATAAGAGGTCAGACACAAATTAAGACAACTCCCATTATC 

AACAGGAACTTTCTCAGTGAGCCATTCACTCCTGGAGAATGGTATAGGAATTTGGAGAGGTGCATTATTTCTT 

TGGCCACTGGGGTTAAATTTAGTGTACTACAACATTGATTTACTGAAGGGCACTAATGTTTCCCCCAGGATTTCT 

ATTGACTAGTCAGGAGTAACAGGTTCACAGAGAGAAGTTGGTGCTTAGTTATGTGTTTTTTAGAGTATATACTAA 

GCTCTACAGGGACAGAATGCTTAATAAATACTTTAATAAGATATGGGAAAATATTTTAATAAAACAAGGAAAACA 

TAATGATGTATAATGCATCCTGATGGGAAGGCATGCAGATGGGATTTGTTAGAAGACAGAAGGAAAGACAGCCAT 

AAATTCTGGCTTTGGGGAAAACTCATATCCCCATGAAAAGGAAGAACAATCACAAATAAAGTGAGAGTAATGTAA 

TGGAGCTCTTTTC^CTAGGGTATAAGTAGCTGCC^^TTTGTAATTCATCTGTTAAAAAAAATCTAGATTATAA^ 

AACTGCTAGCAAAATCTGAGGAAACATAAATTCTTCTGAAGAATCATAGGAAGAGTAGACATTTTATTTATAACC 

AATGATATTTCAGTATATATTTTCTCTCTTTTAAAAAATATTTATCATACTCTGTATATTATTTOTTTTTACTGC 

CTTTATTCTCTCCTGTATATTGGATTTTGTGATTATATTTGAGTGAATAGGAGAAAACAATATATAACACACAGA 

GAATTAAGAAAATGACATTTCTGGGGAGTGGGGATATATATTTGTTGAATAACAGAACGAGTGTAAAATTTTAAC 

AACGGAAAGGGTTAAATTAACTCTTTGACATCTTCACTGAACC 

GTAGTATTGTTTTTGTAATTTAACAATAAATAAGCCTGCTACATGT 
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FIGURE 176 



></usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA71883 
xsubunit 1 of 1, 777 aa, 1 stop 
><MW: 89651, pi: 7.97, NX (S/T) : 3 

^ANKDERLKARSQDFHLFPALMMLSMTMLFLPVTGTLKQNIPRLKLTYKDLLLSNSCIPFL 
GSSEGLDFQTLLLDEERGRLLLGAKDHIFLLSLVDLNKNFKKIYWPAAKERVELCKLAGKDA 
NTECANF IRVLQPYNKTH I YVCGTGAFHP I CGY I DLGVYKED 1 1 FKLDTHNLESGRLKCPFD 
PQQPFASVMTDEYLYSGTASDFLGKDTAFTRSLGPTHDHHYIRTDISEHYWLNGAKFIGTFF 
IPDTYNPDDDKIYFFFRESSQEGSTSDKTILSRVGRVCKNDVGGQRSLINKWTTFLKARLIC 
S I PGSDGADTYFDELQDI YLLPTRDERNPWYGVFTTTSS I FKGSAVCVYSMAD IRAVFNGP 
YAHKES ADHRWVQ YDGR I PYPRPGTCPSKTYDPL I KSTRDFPDDVI S F I KRHSVMYKS VYPV 
AGGPTFKRINVDYRLTQ I WDHVIAEDGQYDVMFLGTD IGTVLKWS I SKEKWNMEEWLEE 
LQIFKHSSIILNMELSLKQQQLYIGSRDGLVQLSLHRCDTYGKACADCCLARDPYCAWDGNA 
CSRYAPTSKRRARRQDVKYGDP ITQCWD IEDS I SHETADEKVI FG I EFNSTFLE CI PKSQQA 
TIKWYIQRSGDEHREELKPDERIIKTEYGLLIRSLQKKDSGMYYCKAQEHTFIHTIVKLTLN 
VI ENEQMENTQRAEHEEGQVKDLLAE SRLR YKD Y I Q I LS S PNFS LDQ YCEQMWHREKRRQRN 
KGGPKWKHMQEMKKKRNRRHHRDLDELPRAVAT 

Important features of the protein: 
Signal peptide: 

amino acids 1-36 

N-glycosylation sites. 

amino acids 139-142, 607-610, 724-727 

Tyrosine kinase phosphorylation site. 

amino acids 571-576 

Gram- positive cocci surface proteins 'anchoring' hexapeptide. 

amino acids 32-3 7 
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FIGURE 177 



CCCTGACCTCCCTGAGCCACACTGAGCTGGAAGCCGCAGAGGTCATCCTGGAGCATGCCCACCGCGGGGAGCAGA 
CAACCTCCCAGGTAAGCTGGGAGCAAGACCTGAAGCTGTTTCTTCAGGAGCCTGGTGTATTTTCCCCCACCCCAC 
CTCAGCAGTTTCAGCCAGCAGGGACTGATCAGGTGTGTGTCCTGGAGTGGGGAGCAGAAGGCGTGGCTGGCAAGA 
GTGGCCTGGAGAAAGAGGTTCAGCGCTTGACCAGCCGAGCTGCCCGTGACTACAAGATCCAGAACCATGGGCATC 
GGGTGAGGTGGGGGGGCACAGGTGTCATGTGCACCTTCTTGTCTCAGCAAGAAGAGCTGAGAGAGGGGATCTTGG 
AGCCATTGAGGGTGTCATGGAGCTACAGAGGGGAGGGAAAGGTATTTTAAGGTAACAGTGTGGCACAATAGTTAA 
GAGCACAGTTTTTGGAGCTAGACCGACATAGGTTCAAATTCTCTTCTGTTGCTTCCTAGTTCTGTAGCCCCAGGT 
AAGGGAGTGACTTAACCTCTCTGGACTTCAATTTCCTCATCACTAAAGTAGGGCCAATAATAGCACCCACCTCAT 
AGGGAAGATTAAATGACATAATGTATGTGATGCAACTAGCAAAGTACCAGTCCCATAGTAAG TCATGCC CCACAG 
TATTTCCACCCACCCCTGTTCTCTGCCTTCCCAACCAGGTACTGCAACGACTGGAGCAGAGGCGGCAGCAGGCTT 
CAGAGCGGGAGGCTCCAAGCATAGAACAGAGGTTACAGGAAGTGCGAGAGAGCATCCGCCGGGCACAGGTGAGCC 
AGGTGAAGGGGGCTGCCCGGCTGGCCCTGCTGCAGGGGGCTGGCTTAGATGTGGAGCGCTGGCTGAAGCCAGCCA 

CAACCGCTGAGGATGCTGAGCTTTCTGACTTTGAGGAATGTGAGGAGACGGGAGAGCTCTTTGAGGAGCCTGCCC 
CCCAAGCCCTGGCCACGAGGGCCCTCCCCTGCCCTGCACACGTGGTATTTCGCTATCAGGCAGGGCGTGAGGATG 
AGCTGACAATCACGGAGGGTGAGTGGCTGGAGGTCATAGAGGAGGGAGATGCTGACGAATGGGTCAAGGCTCGGA 
ACCAGCACGGCGAGGTAGGCTTTGTCCCTGAGCGATATCTCAACTTCCCGGACCTCTCCCTCCCAGAGAGCAGCC 
AAGACAGTGACAATCCCTGCGGGGCAGAGCCCACAGCATTCCTGGCACAGGCCCTGTACAGCTACACCGGACAGA 
GTGCAGAGGAGCTGAGCTTCCCTGAGGGGGCACTCATCCGTCTGCTGCCCCGGGCCCAAGATGGAGTAGATGACG 
GCTTCTGGAGGGGAGAATTTGGGGGCCGTGTTGGGGTCTTCCCCTCCCTGCTGGTGGAAGAGCTGCTTGGCCCCC 
CAGGGCCACCTGAACTCTCTGACCCTGAACAGATGCTGCCGTCCCCTTCTCCTCCCAGCTTCTCCCCACCTGCAC 
CTACCTCTGTGTTGGATGGGCCCCCTGCACCTGTCCTGCCTGGGGACAAAGCCCTGGACTTCCCTGGGTTCCTGG 
ACATGATGGCACCTCGACTCAGGCCGATGCGTCCACCACCTCCCCCGCCGGCTAAAGCCCCGGATCCTGGCCACC 
CAGATC CC CTCACCTGAAGGCCAGGG AAG C CTTGAC C C CCAGTGATGCTGC TGTCCCTATCTTCAAG CTGTCAG A 
CCAC^CCATCAATGATCCAGAGCAAGACAGCCAAAAGCTG^^ 

GGAAACTTGCCCCTTCCCATTTCTAGAGCTGGAACCCACTCCTTTTTTTCCCATTGTTCTATCATCTCTAGGACC 

GGAACTACTACCTTCTCTTCTGTCATGACCCTATCTAGGGTGGTGAAATGCCTGAAATCTCTGGGGCTGGAAACC 

ATCCATCAAGGTCTCTAGTAGTTCTGGCCCACCTCTTTCCCCACCCTGGCTCCATGACCCACCCCACTCTGGATG 

CCAGGGTCACTGGGGTTGGGCTGGGGAGAGGAACAGGCCTTGGGAATCAGGAGCTGGAGCCAGGATGCGAAGCAG 

CTGTAATGGTCTGAGCGGATTTATTGACAATGAATAAAGGGCACGAAGGCCAGGCCAGGGCCTGGGCCTCTTGTG 

CTAAGAGGGCAGGGGGCCTACGGTGCTATTGCTTTAGGGGCCCA 

GCTCTATCATATGGAGCGAGGTGTTGGGGAAGGCGGGGCAGGGAGCCT^ 

TGAGGGGCTGTGACCTCTCCTGAGGCCCCCAGCCTGAGACTGTGCAACTCCAGGTGGAAGTAGAGCTGGTCCCTC 

AGCTGGGGGGCAGTGCTGTCCAGTGGAGGGGAGGGCTTTCACGCCCACCCACCCCCTGGCCCTGCCAGCTGGTAG 

TCCATCAGCACAATGAAGGAGACTTGGAGAAGAGGAAGAATAA(^CTGTTGCTTCCTGTTCAAGCTGTGTCC 

TTTTCCCCTGGGGCTCCAGGACCTTCCCTACCTCCACCACCAAACCAAGGGATTTATAGCAAAGGCTAAGCCTGC 

AGTTTACTCTGGGGGTTCAGGGAGCCGAAAGGCTTAAATAGTTTAAGTAGGTGATGGGAAGATGAGATTACCTCA 

TTTAGGGCTCAGGCAGACTCACCTCACATACTCCCTGCTCCCTGTGGTAGAGACACCTGAGAGAAAGGGGAGG^ 

TCAACAATGAGAGACCAGGAGTAGGTCCTATCAGTGCCCCCCAGAGTAGAGAGCAATAAGAGCCCAGCCCAGTGC 

AGTCCCGGCTGTGTTTTCCTACCTGGTGATCAGAAGTGTCTGGTTTGCTTGGCTGCCCATTTGCCTCTTGAGTGG 

GCAGCCCTGGGCTTGGGCCCCTCCCTCCGGCCCTCAGTGTTGGCTCTGCAGAAGCTCTGGGGTTCCCTTCAAGTG 

CACGAGGGGTTAGGCTGCTGTCCCTGAGTCCTCCATTCTGTAOTGGGGGGCTGGCTAGGACCTGGGGCTGTGGCC 

TCTCAGGGGGCAGCCTCTCCATGGCAGGCATCCCTGCCTTGGGCTGCCCTCCCCCAGACCCCTGACCACCCCCTG 

GGTCCTGTCCCCCACCAGAGCCCCAGCTCCTGTCTGTGGGGGAGCCATCACGGTGTTCGTGCAGTCCATAGCGCT 

TCTCAATGTGTGTCACCCGGAACCTGGGAGGGGAGGGAACACTGGGGTTTAGGACCACAACTCAGAGGCTGCTTG 

GCCCTCCCCTCTGACCAGGGACATCCTGAGTTTGGTGGCTACTTCCCTCTGGCCTAAGGTAGGGGAGGCCTTCTC 

AGATTGTGGGGCACATTGTGTAGCCTGACTTCTGCTGGAGCTCCCAGTCCAGGAGGAAAGAGCCAAGGCCCACTT 

TTGGGATCAGGTGCCTGATCACTGGGCCCCCTACCTCAGCCCCCCTTTCCCTGGAGCACCTGCCCCACCTGCCCA 

CAGAGAACACAGTGGTCTCCCCTGTCCGGGGGCGGCTTTTTCCTTCCTTGGAGCGTCCCTGACGGACAAGTGGAG 

GCCTCTTGCTGCGGCTGCAATGGATGCAAGGGGCTGCAGAGCCCAGGTGCACTGTGTGATGATGGGAGGGGGCTC 

CGTCCTGCAGGCTGGAGGTGGCATCCACACTGGACAGCAGGAGGAGGGGAGTGAGGGTAACATTTCCATTTCCCT 

TCATGTTTTGTTTCTTACGTTCTTTCAGCATGCTCCTTAAAACCCCAGAAGCCCCAATTTCCCCAAGCCCCATTT 

TTTCTTGTCTTTATCTAATAAACTCAATATTAAG 
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FIGURE 178 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA73401 
xsubunit 1 of 1, 3 70 aa, l stop 
><MW: 40685, pi: 4.53, NX(S/T) : 0 

MQLAKYQSHSKSCPTVFPPTPVLCLPNQVLQRLEQRRQQASEREAPSIEQRLQEVRESIRRA 
QVSQVKGAARLALLQGAGLDVERWLKPAMTQAQDEVEQERRLSEARLSQRDLSPTAEDAELS 
DPEECEETGELFEEPAPQALATRALPCPAHWFRYQAGREDELTITEGEWLEVIEEGDADEW 
VKARNQHGEVGFVPERYLNFPDLSLPESSQDSDNPCGAEPTAFLAQALYSYTGQSAEELSFP 
EGALIRLLPRAQDGVDDGFWRGEFGGRVGVFPSLLVEELLGPPGPPELSDPEQMLPSPSPPS 
FSPPAPTSVLDGPPAPVLPGDKALDFPGFLDMMAPRLRPMRPPPPPPAKAPDPGHPDPLT 
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FIGURE 179 A 

CAC^GGGAGACCCACAGACAC^TATGCACGAGAGAGACAGAGGAGGAAAGAGACAGAGACAAAGGCACAGCGGAA 
GAAGG CAGAGACAGGGC AGGC AC AGAAG CGG CCCAGACAGAGTCCTACAGAGGGAGAGG CCAGAG AAGCTG CAGA 
AGACACAGGCAGGGAGAGACAAAGATCCAGGAAAGGAGGGCTCAGGAGGAGAGTTTGGAGAAGCCAGACCCCTGG 
GCACCTCTCCCAAGCCCAAGGACTAAGTTTTCTCCATTTCCTTTAACGGTCCTCAGCCCTTCTGAAAACTTTGCC 
TCTGACCTTGGCAGGAGTCCAAGCCCCCAGGCTACAGAGAGGAGCTTTCCAAAGCTAGGGTGTGGAGGACTTGGT 
GCCCTAGACGGCCTCAGTCCCTCCCAGCTGCAGTACCAGTGCCATGTCCCAGACAGGCTCGCATCCCGGGAGGGG 
CTTGGCAGGGCGCTGGCTGTGGGGAGCCCAACCCTGCCTCCTGCTCCCCATTGTGCCGCTCTCCTGGCTGGTGTG 
GCTGCTTCTGCTACTGCTGGCCTCTCTCCTGCCCTCAGCCCGGCTGGCCAGCCCCCTCCCCCGGGAGGAGGAGAT 
CGTGTTTCCAGAGAAGCTCAACGGCAGCGTCCTGCCTGGCTCGGGCGCCCCTGCCAGGCTGTTGTGCCGCTTGCA 
GGCCTTTGGGGAGACGCTGCTACTAGAGCTGGAGCAGGACTCCGGTGTGCAGGTCGAGGGGCTGACAGTGCAGTA 
CCTGGGCCAGGCGCCTGAGCTGCTGGGTGGAGCAGAGCCTGGCACCTACCTGACTGGCACCATCAATGGAGATCC 
GGAGTCGGTGGCATCTCTGCACTGGGATGGGGGAGCCCTGTTAGGCGTGTTACAATATCGGGGGGCTGAACTCCA 
CCTCCAGCCCCTGGAGGGAGGCACCCCTAACTCTGCTGGGGGACCTGGGGCTCACATCCTACGCCGGAAGAGTCC 
TGCCAGCGGTCAAGGTCCCATGTGCAACGTCAAGGCTCCTCTTGGAAGCCCCAGCCCCAGACCCCGAAGAGCCAA 
GCGCTTTGCTTCACTGAGTAGATTTGTGGAGACACTGGTGGTGGCAGATGACAAGATGGCCGCATTCCACGGTGC 
GGGGCTAAAGCGCTACCTGCTAACAGTGATGGCAGCAGCAGCCAAGGCCTTCAAGCACCCAAGCATCCGCAATCC 
TGTCAGCTTGGTGGTGACTCGGCTAGTGATCCTGGGGTCAGGCGAGGAGGGGCCCCAAGTGGGGCCCAGTGCTGC 
CCAGACCCTGCGCAGCTTCTGTGCCTGGCAGCGGGGCCTCAACACCCCTGAGGACTCGGGCCCTGACCACTTTGA 
CACAGCCATTCTGTTTACCCGTCAGGACCTGTGTGGAGTCTCCACTTGCGACACGCTGGGTATGGCTGATGTGGG 
CACCGTCTGTGACCCGGCTCGGAGCTGTGCCATTGTGGAGGATGATGGGCTCCAGTCAGCCTTCACTGCTGCTCA 
TG AACTGGGTCATGTCTT CAACATG CT C CATGACAAC T C CAAG C CATGCATCAGTTTGAATGGGCC TTTGAG CAC 

CTCTCGCCATGTCATGGCCCCTGTGATGGCTCATGTGGATCCTGAGGAGCCCTGGTCCCCCTGCAGTGCCCGCTT 
CATCACTGACTTCCTGGACAATGGCTATGGGCACTGTCTCTTAGACAAACCAGAGGCTCCATTGCATCTGCCTGT 
GACTTTCCCTGGCAAGGACTATGATGCTGACCGCCAGTGCCAGCTGACCTTCGGGCCCGACTCACGCCATTGTCC 
ACAGCTTGCCGCCGCCCTGTGCTGCCCTCTGGTGCTCTGGCCACCTCAATGGCCATGCCATGTGCCAGACCAAACA 
CTCGCCCTGGGCCGATGGCACACCCTGCGGGCCCGCACAGGCCTGCATGGGTGGTCGCTGCCTCCACATGGACCA 
GCTCCAGGACTTC^ATATTCCACAGGCTGGTGGCTGGGGTCCTTGGGGACCATGGGGTGACTGCTOTCGGACCTG 
TGGGGGTGGTGTCCAGTTCTCCT C C CGAGACTG CACGAGGCCTGTCCCCCGGAATGGTGG CAAGTACTGTGAGGG 

CCGCCGTACCCGCTTCCGCTCCTGCAACACTGAGGACTGCCCAACTGGCTCAGCCCTGACCTTCCGCGAGGAGCA 

GTGTGCTGCCTACAACCACCGCACCGACCTCTTCAAGAGCTTCCCAGGGCCCATGGACTGGGTTCCTCGCTACAC 

AGGCGTGGCCCCCCAGGACCAGTGCAAACTCACCTGCCAGGCCCGGGCACTGGGCTACTACTATGTGCTGGAGCC 

ACGGGTGGTAGATGGGACCCCCTGTTCCCCGGACAGCTCCTCGGTCTGTGTCCAGGGCCGATGCATCCATGCTGG 

CTGTGATCGCATCATTGGCTCCAAGAAGAAGTTTGACAAGTGCATGGTGTGCGGAGGGGACGGTTCTGGTTGCAG 

CAAGCAGTCAGGCTCCTTCAGGAAATTC^GGTACGGATACAACAATGTGGTCACTATCCCCGCGGGGGCCACCCA 

CATTCTTGTCCGGCAGCAGGGAAACCCTGGCCACCGGAGCATCTACTTGGCCCTGAAGCTGCCAGATGGCTCCTA 

TGCCCTCAATGGTGAATACACGCTGATGCCCTCCCCCACAGATGTGGTACTGCCTGGGGCAGTCAGCTTGCGCTA 

CAGCGGGGCCACTGCAGCCTCAGAGACACTGTCAGGCCATGGGCC^CTGGCCCAGCCTTTGACACT^ 

AGTGGCTGGC^WVCCCCCAGGACACACGCCTCCGATACAGCTTCTTCGTGCCCCGGCCGACCCCTTCAACGCCACG 

CCCCACTCCCCAGGACTGGCTGCACCGAAGAGCACAGATTCTGGAGATCCTTCGGCGGCGCCCCTGGGCGGGCAG 

GAAATAACCTCACTATCCCGGCTGCCCTTTCTGGGCACCGGGGCCTCGGACTTAGCTGGGAGAAAGAGAGAGCTT 

CTGTTGCTGCCTCATGCTAAGACTCAGTGGGGAGGGGCTGTGGGCGTGAGACCTGCCCCTCCTCTCTGCCCTAAT 

GCGCAGGCTGGCCCTGCCCTGGTTTCCTGCCCTGGGAGGCAGTGATGGGTTAGTGGATGGAAGGGGCTGACAGAC 

AGCCCTCCATCT7UVACTGCCCCCTCTGCCCTGCGGGTCACAGGAGGGAGGGGGAAGGCAGGGAGGGCCTGGGCCC 
CAGTTGTATTTATTTAGTATTTATC 

CCTGACCCCTGACCCCTCATAGCCCTCACCCTGGGGCTAGGAAATCCAGGGTGGTGGTGATAGGTATAAGTGGTG 

TGTGTATG CGTG TGTGTGTGTGTGTGAAAATGTGTGTGTGCTTATGTATGAGGTACAACCTGTTCTGCTTTCCTC 
TTCCTGAATTTTATTTTTTC^ 

TTTTTTTTTCTTTCTTTCTTTCTTTTTTTTTTTTGAGACAGAATCTCGCTCTGTCGCCCAGGCTGGAGTGCAATG 
GCACAATCTCGGCTCACTGCATCCTCCGCCTCCCGGGTTCAAGTGATTCTCATGCCTCAGCCTCCTGAGTAGCTG 
GGATTACAGGCTCCTGCCACCACGCCCAGCTAATTTTTGTTTTGTTTTGTTTGGAGACAGAGTCTCGCTATTGTC 
ACCAGGGCTGGAATGATTTCAGCTCACTGCAACCTTCGCCACCTGGGTTCCAGCAATTCTCCTGCCTCAGCCTCC 
CGAGTAGCTGAGATTATAGGCACCTACCACCACGCCCGGCTAATTTTTGTATTTTTAGTAGAGACGGGGTTTCAC 
CATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTTAGGTGATCCACTCGCCTTCATCTCCCAAAGTGCTGGGATT 
ACAGGCGTGAGCCACCGTGCCTGGCCACGCCCAACTAATTTTTGTATTTTTAGTAGAGACAGGGTTTCACCATGT 
TGGCCAGGCTGCTCTTGAACTCCTGACCTCAGGTAATCGACCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGG 
TGTGAGCCACCACGCCCGGTACATATTTTTTAAATTGAATTCTACTATTTATGTGATCCTTTTGGAGTCAGACAG 
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FIGURE 179B 

ATGTGGTTGCATCCTAACTCCATGTCTCTGAGCATTAGATTTCTCATTTGCCAATAATAATACCTCCCTTAGAAG 
TTTGTTGTGAGGATTAAATAATGTAAATAAAGAACTAGCATAA(^CTCAAAAAAAAAAAAAAAAAAAAAAAAAAA 




WO 00/12708 



PCT/US99/20111 



FIGURE 180 



>< /usr/seqdb2 /ss t /DNA/Dnaseqs . rain/ss . DNA73 4 92 
xsuburiit 1 of 1, 837 aa, 1 stop 
><MW: 90167, pi: 8.39, NX(S/T): 1 

MSQTGSHPGRGLAGRWLWGAQPCLLLPIVPLSWLVWLLLLLLASLLPSARLASPLPREEEIV 

FPEKLNGSVLPGSGAPARLLCRLQAFGETLLLELEQDSGVQVEGLTVQYLGQAPELLGGAEP 

GTYLTGTINGDPESVASLHWDGGALLGVLQYRGAELHLQPLEGGTPNSAGGPGAHILRRKSP 

ASGQGPMCNVKAPLGSPSPRPRRAKRFA5LSRFVETLVVADDKMAAFHGAGLICRYLLTVMAA 

AAKAFKHPSIRNPVSLWTRLVILGSGEEGPQVGPSAAQTLRSFCAWQRGLNTPEDSGPDHF 

DTAILFTRQDLCGVSTCDTLGMADVGTVCDPARSCAIVEDDGLQSAFTAAHELGHVFNMLHD 

NSKPCISLNGPLSTSRHVMAPVMAHVDPEEPWSPCSARFITDFLDNGYGHCLLDKPEAPLHL 

PVTFPGKDYDADRQCQLTFGPDSRHCPQLPPPCAALWCSGHLNGHAMCQTKHSPWADGTPCG 

PAQACMGGRCLHMDQLQDFNIPQAGGWGPWGPWGDCSRTCGGGVQFSSRDCTRPVPRNGGKY 

CEGRRTRFRSCNTEDCPTGSALTFREEQCAAYNHRTDLFKSFPGPMDWVPRYTGVAPQDQCK 

LTCQARALGYYYVLEPRWDGTPCSPDSSSVCVQGRCIHAGCDRIIGSKKKFDKCMVCGGDG 

SGCSKQSGSFRKFRYGYNNWTIPAGATHILVRQQGNPGHRSIYLALKLPDGSYALNGEYTL 

MPSPTDWLPGAVSLRYSGATAASETLSGHGPLAQPLTLQVLVAGNPQDTRLRYSFFVPRPT 
PSTPRPTPQDWLHRRAQ I LEI LRRRPWAGRK 

Important features of the protein: 
Signal peptide: 

amino acids 1-48 

N-glycosylation site. 

amino acids 68-71 

Glycosaminoglycan attachment site 

amino acids 188-191, 772-775 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 182-185 

Tyrosine kinase phosphorylation site. 

amino acids 730-736 

N-myristoylation sites. 



amino acids 5-10, 19-24, 121-126, 125-130, 130-135, 147-152, 167- 
172, 168-173, 174-179, 323-328, 352-357, 539-544, 555-560, 577- 
582, 679-684, 682-687, 763-768 



Amidation sites. 

amino acids 560-563, 834-837 
Leucine zipper pattern, 
amino acids 17-38, 24-45 



zinc metal lopeptidases, zinc -binding region signature. 



amino acids 358-367 



/S3/270 
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FIGURE 181 



CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCAIGG 
CAAAGAATCCTCCAGAGAATTGTGAAGACTGTCACATTCTAAATGCAGAAGCTTTTAAATCC 
AAGAAAATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCT 
AATTGTCCTGTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACA 
TGGAG CACACTTT CTACAGC AATGGAGAG AAG AAGAAGAT TTACATGG AAAT TG ATCCTGTG 
ACCAGAACTGAAATATTCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACTT 
TAAAAACGGATACACTGGCATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGA 
TT AAAGTG ATT C CTGAATTT TC TGAACC AGAAGAGG AAAT AGATGAGAATGAAGAAATTAC C 
ACAACTTT CTTTG AAC AGT C AGTGAT TTGGGT C C C AGCAGAAAAGCCTATTGAAAACCGAGA 
TTTTCTTAAAAATT CC AAAATT CTGGAGATTTGTGATAACGTGACCATGTATTGGATCAATC 
C CACT CTAAT AT C AGTTT CTGAGTT AC AAGACTTTGAGGAGGAG GGAGAAGAT CTT CACTTT 
CCTGCCAACGAAAAAAAAGGGATTGAACAAAATGAACAGTGGGTGGTCCCTCAAGTGAAAGT 
AG AGAAGACCCGTCACGC CAGACAAGCAAGTGAGGAAGAACTTC CAAT AAATGACT ATACTG 
AAAATGGAATAGAATTTGATCCCATGCTGGATGAGAGAGGTTATTGTTGTATTTACTGCCGT 
CGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTAGGCTACTACCCATATCCATA 
CTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTGTAACTGGTGGGTGGCCC 
GCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAACTGCTGGCAACATAT 
AATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCTGGTAGCCAGCT 
CTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACATTATCACC 
AAAAAAAAAAAAAAAAAAA 



# 
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FIGURE 182 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA73727 
xsubunit 1 of 1, 317 aa, 1 stop 
><MW: 37130, pi: 5.18, NX(S/T): 3 

MAKNP PENCEDCH I LNAEAFKS KK I CKS LK I CGLVFG I LALTLIVLFWGSKHFWPEVPKKAY 
DMEHTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKT 
QIKVIPEFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWI 
NPTLISVSELQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDY 
TENG I E FD PMLDERGY CC I Y CRRGNR YCRRVCEPLLGYYP YP YCYQGGRV I CRV IMPCNWWV 
ARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-4 0 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C- terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme -binding site signature. 

amino acids 9-14 
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FIGURE 183 

GCGGAACTGGCTCCGGCTGGCACCTGAGGAGCGGCGTGACCCCGAGGGCCCAGGGAGCTGCC 

CGGCTGGCCTAGGCAGGCAGCCGGACCATGGCCAGCACGGCCGTGCAGCTTCTGGGCTTCCT 

GCTCAGCTTCCTGGGCATGGTGGGCACGTTGATCACCACCATCCTGCCGCACTGGCGGAGGA 

CAGCGCACGTGGGCACCAACATCCTCACGGCCGTGTCCTACCTGAAAGGGCTCTGGATGGAG 

TGTGTGTGGCACAGCACAGGCATCTACCAGTGCCAGATCTACCGATCCCTGCTGGCGCTGCC 

CCAAGACCTCCAGGCTGCCCGCGCCCTCATGGTCATCTCCTGCCTGCTCTCGGGCATAGCCT 

GCGCCTGCGCCGTCATCGGGATGAAGTGCACGCGCTGCGCCAAGGGCACACCCGCCAAGACC 

ACCTTTGCCATCCTCGGCGGCACCCTCTTCATCCTGGCCGGCCTCCTGTGCATGGTGGCCGT 

CTCCTGGACCACCAACGACGTGGTGCAGAACTTCTACAACCCGCTGCTGCCCAGCGGCATGA 

AGTTTGAGATTGGCCAGGCCCTGTACCTGGGCTTCATCTCCTCGTCCCTCTCGCTCATTGGT 

GGCACCCTGCTTTGCCTGTCCTGCCAGGACGAGGCACCCTACAGGCCCTACCAGGCCCCGCC 

CAGGGCCACCACGACCACTGCAAACACCGCACCTGCCTACCAGCCACCAGCTGCCTACAAAG 

ACAATCGGGCCCCCTCAGTGACCTCGGCCACGCACAGCGGGTACAGGCTGAACGACTAGGTG 

IG&GTCCCCACAGCCTGCTTCTCCCCTGGGCTGCTGTGGGCTGGGTCCCCGGCGGGACTGTC 

AATGGAGGCAGGGGTTCCAGCACAAAGTTTACTTCTGGGCAATTTTTGTATCCAAGGAAATA 

ATGTGAATGCGAGGAAATGTCTTTAGAGCACAGGGACAGAGGGGGAAATAAGAGGAGGAGAA 

AGCTCTCTATACCAAAGACTGAAAAAAAAAATCCTGTCTGTTTTTGTATTTATTATATATAT 

TTATGTGGGTGATTTGATAACAAGTTTAATATAAAGTGACTTGGGAGTTTGGTCAGTGGGGT 

TGGTTTGTGATCCAGGAATAAACCTTGCGGATGTGGCTGTTTATGAAAAAAAAAAAAA 
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FIGURE 184 



MASTAVQLLGFLLSFLGMVGTLITTILPHWRRTAHVGTNILTAVSYLKGLWMECVWHSTGIY 
QCQIYRSLLALPQDLQAARALMVISCL^ 

FILAGLLCMVAVSWTTNDVVQNFYNPLLPSGMKFEIGQALYLGFISSSLSLIGGTLLCLSCQ 
DEAPYRPYQAPPRATTTTANTAPAYQPPAAYKDNRAPSVTSATHSGYRLNDYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domains: 

amino acids 82-103, 115-141, 160-182 
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FIGURE 185 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCG 
CAGGCGGCAGGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCT 
CCTGTCCATCCTGGGGCTGGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCC 
AGGACCTGTACGACAACCCCGTCACCTCCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGC 
GTGAGGCAGAGTTCAGGCTT CACCGAATGCAGGC CCTATTTCACCATCCTGGGACTTCCAGC 
CATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCTGGGTGCCATTGGCCTCC 
TGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTGCCAAAGCC 
AACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGT 
GTCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCG 
GCATGGGTGGGATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTG 
GGCTGGGTCGCTGGAGGCCTCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGG 
CCTGGCAC CAGAAGAAAC CAACTACAAAGCCGTTT CTTATCATG C CTCAGG C CACAGTGTTG 
CCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTTTGGGTCCAACACCAAAAACAAGAAG 
AT ATACGATGGAGGTG C CCG CACAGAGGACGAGGTACAAT CTTATCCTTC CAAGCACG ACTA 

TGTGTAATGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCACCCAAAA 
AACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGC 
CTCGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCC 
AC CATAAAAC AG CTGAGTTATTTATGAATTAGAGGCTATAGCT C ACATTTT CAAT C CT CTAT 
TTCTTTTTTTAAATATAACTTTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCAC 
ATTTTGATGATTTAGACAGACTCCCC CTCTTC CT CCTAGT CAATAAAC C C ATTG ATGATC TA 
TTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAAGAGTAGACCCAAAGATGTTATTTT 
CTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAAACACTTACTGAAGAA 
GAAGCAATAAGAGAAAGATATTTGTAAT CT CTCCAG CC CATGAT CTCGGTTTTCTTACACTG 
TGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTG 
CTGTTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAG 

TTAAGTCCTAAATATAGTTAAAATAAATAATGTTTTAGTAAAATGATACACTATCTCTGTGA 
AATAGCCT CACC C CTA CATGTGGATAGAAGGAAATG AAAAAATAATTG CTTTGACATTGT CT 
ATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCCATGAAAAGCTCACACCTGTAATC 
CTAG CACTTTGGGAGGCTGAGGAGGAAGGATCACTTGAGCCCAGAAGTT CGAGACTAGCCTG 
GGCAACATGGAGAAGC CC TGTCTCTACAAAATACAGAGAGAAAAAATCAG C CAGTC ATGGTG 
GCATACACCTGTAGTCCCAGCATTCCGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGGG 
AGGTTGGGGCTG CAGTGAGC CATGAT CACACC ACTGCACT CCAGCC AGGTGACATAGCGAGA 
TC CTGT CT AAAAAAATAAAAAATAAATAATGGAACACAG C AAGTCCTAGG AAGTAGGTTAAA 
ACTAATTCTTTAA 
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FIGURE 186 



></usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA73 734 
xsubunit 1 of 1, 261 aa, 1 stop 
><MW: 27856, pi: 8.50, NX(S/T): 1 

MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTE 

CRPYFTILGLPAMLQAVRALMIVGIVLGAIGLLVSIFALKCIRIGSMEDSAKA^TLTSGIM 

FIVSGLCAIAGVSVFANMLVTNFWMSTAJSnVIYTGMGGMVQWQTRYTFGAALFVG 

IGGVMMCIACRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTE 
DEVQSYPSKHDYV 

Signal peptide: 

amino acids 1-23 

Transmembrane domains : 

amino acids 81-100, 121-141, 173-194 



# 
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FIGURE 187 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCG 
GAGTCCAGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTG 
GGCTGTTTCTTGGTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGG 
AGAGTGTCGGCCTTCATTGAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTG 
GATGAATTGCGTGAGGCAGGCTAACATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGG 
CTCTTTCTCCGGACCTACAGGCAGCCAGAGGACTGATGTGTGCTGCTTCCGTGATGTCCTTC 
TTGGCTTTCATGATGGCCATCCTTGGCATGAAATGCACCAGGTGCACGGGGGACAATGAGAA 
GGTGAAGG CT CACATT CTGCTG ACGG CTGGAATC AT CTTCAT CATC ACGGGCATGGTGGTGC 

TCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATAACTCAATAGTGAAT 
GTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCACTGGTGCT 
GATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTACA 
GATACTCGATACCTTCCCATCGCACAACCCAAAAAAGTTATCACACCGGAAAGAAGTCACCG 
AGCGTCTACTCCAGAAGTCAGTATGTGT&STTGTGTATGTTTTTTTAACTTTACTATAAAGC 
CATGCAAATGACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTA 
CTGTTCTTAACTGCCTAATCTTAATTACAGGAACTGTGCATCAGCTATTTATGATTCTATAA 
GCTATTTCAGCAGAATGAGATATTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAAT 
TTGTTTTCTAAGGTGGTTCAAGCATCTACTCTTTTTATCATTTACTTCAAAATGACATTGCT 
AAAGACTGCATTATTTTACTACTGTAATTTCTCCACGACATAGCATTATGTACATAGATGAG 
TGTAACATTTATATCTCACATAGAGACATGCTTATATGGTTTTATTTAAAATGAAATGCCAG 
TCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAAATCATGGATAGGGTTG 
AAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCATTTATAATGA 
AGATTAAAATGAAGGCTTTAATCAGCATTGTAAAGGAAATTGAATGGCTTTCTGATATGCTG 

TTCTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCA 

AACTGCTTTTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATG 

GTTTTAGGAAAGTGAAAATATTTTTGTTTTTGTATTTGAAGAAGAATGATGCATTTTGACAA 

GAAATCATATATGTATGGATATATTTTAATAAGTATTTGAGTACAGACTTTGAGGTTTCATC 

AATATAAATAAAAGAGCAGAAAAATATGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACA 

ACAAAAAAAGTTGTCCTTTGAGAACTTCACCTGCTCCTATGTGGGTACCTGAGTCAAAATTG 

TCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTACCATTTCTGTTTAGTTTTACTAAA 

ATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAAACTGACAATCCAATTTGA 

AAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTTTATACATTTATA 
TTAATAAATTGTACATTTTTCTAATT 
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FIGURE 188 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA73735 
xsubunit 1 of 1, 225 aa, 1 stop 
><MW: 24845, pi: 9.07, NX(S/T): 0 

MATHALE I AGLFLGGVGMVGTVAVTVMPQWRVSAF I ENNIWFENFWEGLWMNCVRQANIRM 
QCKIYDSLI^LSPDLQAARGLMCAASVMSFLAFMMAILGMKCTRCTGDNEKVKAHILLTAGI 
I F 1 1 TGMWL I P VS WVANAI I RDFYNS I VNVAQKRELGEALYLGWTTALVL I VGGALF CC VF 
CCNEKSSS YRYS I PSHRTTQKSYHTGKKSPS VYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains : 

amino acids 82-101, 118-145, 164-188 
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FIGURE 139 



TCGCCATGGCCTCTGCCGGAATGCAGATCCTGGGAGTCGTCCTGACACTGCTGGGCTGGGTG 
AATGGCCTGGTCTCCTGTGCCCTGCCCATGTGGAAGGTGACCGCTTTCATCGGCAACAGCAT 
CGTGGTGGCCCAGGTGGTGTGGGAGGGCCTGTGGATGTCCTGCGTGGTGCAGAGCACCGGCC 
AGATGCAGTGCAAGGTGTACGACTCACTGCTGGCGCTGCCACAGGACCTGCAGGCTGCACGT 
GCCCTCTGTGTCATCGCCCTCCTTGTGGCCCTGTTCGGCTTGCTGGTCTACCTTGCTGGGGC 
CAAGTGTACCACCTGTGTGGAGGAGAAGGATTCCAAGGCCCGCCTGGTGCTCACCTCTGGGA 
TTGTCTTTGTCATCTCAGGGGTCCTGACGCTAATCCCCGTGTGCTGGACGGCGCATGCCATC 
ATCCGGGACTTCTATAACCCCCTGGTGGCTGAGGCCCAAAAGCGGGAGCTGGGGGCCTCCCT 
CTACTTGGGCTGGGCGGCCTCAGGCCTTTTGTTGCTGGGTGGGGGGTTGCTGTGCTGCACTT 
GCCCCTCGGGGGGGTCCCAGGGCCCCAGCCATTACATGGCCCGCTACTCAACATCTGCCCCT 
GCCATCTCTCGGGGGCCCTCTGAGTACCCTACCAAGAATTACGTCTGACGTGGAGGGGAATG 
GGGGCTCCGCTGGCGCTAGAGCCATCCAGAAGTGGCAGTGCCCAACAGCTTTGGGATGGGTT 
CGTACCTTTTGTTTCTGCCTCCTGCTATTTTTCTTTTGACTGAGGATATTTAAAATTCATTT 
GAAAACTGAGCCAAGGTGTTGACTCAGACTCTCACTTAGGCTCTGCTGTTTCTCACCCTTGG 
ATGATGGAGCCAAAGAGGGGATGCTTTGAGATTCTGGATCTTGACATGCCCATCTTAGAAGC 
CAGTCAAGCTATGGAACTAATGCGGAGG CTGCTTGCTGTG CTGG CT TTG CAACAAGACAGAC 
TGTCCCCAAGAGTTCCTGCTGCTGCTGGGGGCTGGGCTTCCCTAGATGTCACTGGACAGCTG 
CCCCCCATCCTACTCAGGTCTCTGGAGCTCCTCTCTTCACCCCTGGAAAAACAAATCATCTG 
TTAACAAAGGACTGCCCACCTCCGGAACTTCTGACCTCTGTTTCCTCCGTCCTGATAAGACG 
TCCACCCCCCAGGGCCAGGTCCCAGCTATGTAGACCCCCGCCCCCACCTCCAACACTGCACC 
CTTCTGCCCTGCCCCCCTCGTCTCACCCCCTTTACACTCACATTTTTATCAAATAAAGCATG 
TTTTGTTAGTGCA 
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FIGURE 190 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA73736 
xsubunit 1 of 1, 220 aa, 1 stop 
><MW: 23292, pi: 8.43, NX(S/T): 0 

MAS AGMQ I LG WLTLLGWVNGL VS CALPMWKVTAF I GNS I WAQ WWEGLWMS C WQSTGQM 
QCKVYDSLLALPQDLQAARALCVIALLV^ 

FVISGVLTLIPVCWTAHAIIRDFYNPLVAEAQKRELGASLYLGWAASGLLLLGGGLLCCTCP 
SGGSQGPSHYMARYSTSAPAI SRGPSEYPTKNYV 

Transmembrane domains : 

amino acids 8-30 (type II), 82-102, 121-140, 166-186 
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FIGURE 191 



G C CAAGGAGAACATCATCAAAGACTTCT CT AG ACTCAAAAGG CTT C CA CGTT CTACAT CTTG 
AGCATCTTCTACCACTCCGAATTGAACCAGTCTTCAAAGTAAAGGCAATGGCATTTTATCCC 
TTGCAAATTGCTGGGCTGGTTCTTGGGTTCCTTGGCATGGTGGGGACTCTTGCCACAACCCT 
TCTGCCTCAGTGGTGGAGTATCAGCTTTTGTTGGCAGCAACATTATTGTCTTTGAGAGGCTC 
TGGGAAGGGCTCTGGATGAATTGCATCCGACAAGCCAGGGTCCGGTTGCAATGCAAGTTCTA 
TAGCTCCTTGTTGGCTCTCCCGCCTGCCCTGGAAACAGCCCGGGCCCTCATGTGTGTGGCTG 
TTGCTCTCTCCTTGATCGCCCTGCTTATTGGCATCTGTGGCATGAAGCAGGTCCAGTGCACA 
GGCTCTAACGAGAGGGCCAAAGCATACCTTCTGGGAACTTCAGGAGTCCTCTTCATCCTGAC 
GGGTATCTTCGTTCTGATTCCGGTGAGCTGGACAGCCAATATAATCATCAGAGATTTCTACA 
AC C C AG CC AT CC AC AT AGGT CAGAAACGAGAGCTGGGAGCAGCACTTTTC CTTGGCTGGG CA 
AGCGCTGCTGTCCTCTTCATTGGAGGGGGTCTGCTTTGTGGATTTTGCTGCTGCAACAGAAA 
GAAG CAAGGGTACAGATATCCAGTGCCTGGCTACCGTGTG CC ACAC ACAG AT AAG CGAAGAA 
ATACGACAATGCTTAGTAAGACCTCCACCAGTTATGTCTAATGCCTCCTTTTGGCTCCAAGT 
ATGGACTATGGT CAATGTTTTTTATAAAGTC CTGCTAGAAACTGTAAGTATG TG AGGCAGGA 
GAACTTGCTTTATGTCTAGATTTACATTGATACGAAAGTTTCAATTTGTTACTGGTGGTAGG 
AATGAAAATGACTTACTTGGACATTCTGAC TT CAGGTGTATTAAATG C ATTGACTATTGTTG 
GACCCAAT CGCTGCTCCAATTTTC AT AT TCTAAATTCAAGTATAC C CATAATCATTAGCAAG 
TGTACAATGATGGACTACTTATTACTTTTTGACCATCATGTATTATCTGATAAGAATCTAAA 
GTTGAAATTGATATT CTATAACAATAAAACATATACCTATT CTA 
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FIGURE 192 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA73737 
xsubunit 1 of 1, 173 aa, 1 stop 
><MW: 18938, pi: 9.99, NX (S/T) : 1 

MNC I RQARVRLQCKF YS S LLAL P P ALETARALMCVAVALS L I ALL I G I CGMKQVQCTGSNER 
AKAYLLGT SGVLF I LTG I FVL I PVSWTAN III RDFYNP A I H I GQKRELGAALFLGWASAAVL 
FIGGGLLCGFCCCNRKKQGYRYPVPGYRVPHTDKRRNTTMLSKTSTSYV 

Important features of the protein: 
Transmembrane domains : 

amino acids 31-51, 71-90, 112-133 

N-glycosylation site. 

amino acids 161-164 
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FIGURE 193 



AGTGACAATCTCAGAGCAGCTTCTACACCACAGCCATTTCCAGCATGAAGATCACTGGGGGT 
CTCCTTCTGCTCTGTACAGTGGTCTATTTCTGTAGCAGCTCAGAAGCTGCTAGTCTGTCTCC 
AAAAAAAGTGGACTGCAGCATTTACAAGAAGTATCCAGTGGTGGCCATCCCCTGCCCCATCA 
CATACCTACCAGTTTGTGGTTCTGACTACATCACCTATGGGAATGAATGTCACTTGTGTACC 
GAGAGCTTGAAAAGTAATGGAAGAGTTCAGTTTCTTCACGATGGAAGTTGCTAAATTCTCCA 
TGGACATAGAGAGAAAGGAATGATATTCTCAT CATC AT CTTCAT CATCCCAGGCTCTGACTG 
AGTTTCTTTCAGTTTTACTGATGTTCTGGGTGGGGGACAGAGCCAGATTCAGAGTAATCTTG 
ACTGAATGGAGAAAGTTTCTGTGCTACCCCTACAAACCCATGCCTCACTGACAGACCAGCAT 
TTTTTTTTTAACACGTCAATAAAAAAATAATCTCC CAGA 
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FIGURE 194 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA73739 
xsubunit 1 of 1, 85 aa, 1 stop 
><MW: 9232, pi; 7.94, NX(S/T): 0 

MKITGGLLLLCTWYFCSSSEAASLSPKKVDCS I YKKYPWAI PCP ITYLPVCGSDYITYGN 
ECHLCTESLKSNGRVQFLHDGSC 

Signal peptide: 

amino acids 1-19 



/ 9 
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FIGURE 195 

CCCGCGCCCGGTTCTCCCTCGCAGCACCTCGAAGTGCGCCCCTCGCCCTCCTGCTCGCGCCC 
CGCCGCCATGGCTGCCTCCCCCGCGCGGCCTGCTGTCCTGGCCCTGACCGGGCTGGCGCTGC 
TCCTGCTCCTGTGCTGGGGCCCAGGTGGCATAAGTGGAAATAAACTCAAGCTGATGCTTCAA 
AAACGAGAAGCACCTGTTCCAACTAAGACTAAAGTGGCCGTTGATGAGAATAAAGCCAAAGA 
ATTCCTTGGCAGCCTGAAGCGCCAGAAGCGGCAGCTGTGGGACCGGACTCGGCCCGAGGTGC 
AGCAGTGGTACCAGCAGTTTCTCTACATGGGCTTTGATGAAGCGAAATTTGAAGATGACATC 
ACCTATTGGCTTAACAGAGATCGAAATGGACATGAATACTATGGCGATTACTACCAACGTCA 
CTATGATGAAGACTCTGCAATTGGTCCCCGGAGCCCCTACGGCTTTAGGCATGGAGCCAGCG 
TCAACTACGATGACTAC3EAACCATGACTTGCCACACGCTGTACAAGAAGCAAATAGCGATTC 
TCTTCATGTATCTCCTAATGCCTTACACTACTTGGTTTCTGATTTGCTCTATTTCAGCAGAT 
CTTTTCTACCTACTTTGTGTGATCAAAAAAGAAGAGTTAAAACAACACATGTAAATGCCTTT 
TGATATTTCATGGGAATGCCTCTCATTTAAAAATAGAAATAAAGCATTTTGTTAAAAAGA 



/9Z/Z7D 
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FIGURE 196 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA73742 
xsubunit 1 of 1, 148 aa, 1 stop 
><MW: 17183, pi: 8.77, NX(S/T): 0 

MAAS PARPAVLALTGLALLLLLCWGPGG I SGNKLKLMLQKREAP VPTKTKVAVDENKAKEFL 

GSLKRQKRQLWDRTRPEVQQWYQQFLYMGFDEAKFEDDITYWLNRDRNGHEYYGDYYQRHYD 
EDSAIGPRSPYGFRHGASVNYDDY 

Signal peptide: 

amino acids 1-3 0 
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FIGURE 197 



CGGCTCGAGCCCGCCCGGAAGTGCCCGAGGGGCCGCGATGGAGCTGGGGGAGCCGGGCGCTC 

GGTAGCGCGGCGGGCAAGGCAGGCGCCATGACCCTGATTGAAGGGGTGGGTGATGAGGTGAC 

CGTCCTTTTCTCGGTGCTTGCCTGCCTTCTGGTGCTGGCCCTTGCCTGGGTCTCAACGCACA 

CCGCTGAGGGCGGGGACCCACTGCCCCAGCCGTCAGGGACCCCAACGCCATCCCAGCCCAGC 

GCAGCCATGGCAGCTACCGACAGCATGAGAGGGGAGGCCCCAGGGGCAGAGACCCCCAGCCT 

GAGACACAGAGGTCAAGCTGCACAGCCAGAGCCCAGCACGGGGTTCACAGCAACACCGCCAG 

CCCCGGACTCCCCGCAGGAGCCCCTCGTGCTACGGCTGAAATTCCTCAATGATTCAGAGCAG 

GTGGCCAGGGCCTGGCCCCACGACACCATTGGCTCCTTGAAAAGGACCCAGTTTCCCGGCCG 

GGAACAGCAGGTGCGACTCATCTACCAAGGGCAGCTGCTAGGCGACGACACCCAGACCCTGG 

GCAGCCTTCACCTCCCTCCCAACTGCGTTCTCCACTGCCACGTGTCCACGAGAGTCGGTCCC 

CCAAATCCCCCCTGCCCGCCGGGGTCCGAGCCCGGCCCCTCCGGGCTGGAAATCGGCAGCCT 

GCTGCTGCCCCTGCTGCTCCTGCTGTTGCTGCTGCTCTGGTACTGCCAGATCCAGTACCGGC 

CCTTCTTTCCCCTGACCGCCACTCTGGGCCTGGCCGGCTTCACCCTGCTCCTCAGTCTCCTG 

GCCTTTGCCATGTACCGCCCGTAGTGCCTCCGCGGGCGCTTGGCAGCGTCGCCGGCCCCTCC 

GGACCTTGCTCCCCGCGCCGCGGCGGGAGCTGCTGCCTGCCCAGGCCCGCCTCTCCGGCCTG 

CCTCTTCCCGCTGCCCTGGAGCCCAGCCCTGCGCCGCAGAGGACTCCCGGGACTGGCGGAGG 

CCCCGCCCTGCGACCGCCGGGGCTCGGGGCCACCTCCCGGGGCTGCTGAACCTCAGCCCGCA 

CTGGGAGTGGGCTCCTCGGGGTCGGGCATCTGCTGTCGCTGCCTCGGCCCCGGGCAGAGCCG 

GGCCGCCCCGGGGGCCCGTCTTAGTGTTCTGCCGGAGGACCCAGCCGCCTCCAATCCCTGAC 

AGCTCCTTGGGCTGAGTTGGGGACGCCAGGTCGGTGGGAGGCTGGTGAAGGGGAGCGGGGAG 

GGGCAGAGGAGTTCCCCGGAACCCGTGCAGATTAAAGTAACTGTGAAGTTTTAAAAAAAAAA 
AAAAAAAA 
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FIGURE 198 



MTLIEGVGDEVTVLFSVLACLLVLAL 

RGEAPGAETPSLRHRGQAAQPEPSTGFTATPPAPDSPQEPLVLRLKFLNDSEQVARAWPHDT 
IGSLKRTQFPGREQQVRLIYQGQLLGDDTQTLGSLHLPPNCVLHCHVSTRVGPPNPPCPPGS 
E PGP SGLE I GS LLLPLLLLLLLLLWYCQ I Q YRP FF PLTATLGLAGFTLLLS LLAFAMYR P 

Signal peptides 

amino acids 1-31 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 199 

GAGATTGGAAACAGCCAGGTTGGAGCAGTGAGTGAGTAAGGAAACCTGGCTGCCCTCTCCAG 
ATTCCCCAGGCTCTCAGAGAAGATCAGCAGAAAGTCTGCAAGACCCTAAGAACCATCAGCCC 
TCAGCTGCACCTCCTCCCCTCCAAGGATgACAAAGGCGCTACTCATCTATTTGGTCAGCAGC 
TTTCTTGCCCTAAATCAGGCCAGCCTCATCAGTCGCTGTGACTTGGCCCAGGTGCTGCAGCT 
GGAGGACTTGGATGGGTTTGAGGGTTACTCCCTGAGTGACTGGCTGTGCCTGGCTTTTGTGG 
AAAGCAAGTTCAACATATCAAAGATAAATGAAAATGCGGATGGAAGCTTTGACTATGGCCTC 
TTCCAGATCAACAGCCACTACTGGTGCAACGATTATAAGAGTTACTCGGAAAACCTTTGCCA 
CGTAGACTGTCAAGATCTGCTGAATCCCAACCTTCTTGCAGGCATCCACTGCGCAAAAAGGA 
TTGTGTCCGGAGCACGGGGGATGAACAACTGGGTAGAATGGAGGTTGCACTGTTCAGGCCGG 
CCACTCTCCTACTGGCTGACAGGATGCCGCCTGAGATgAAACAGGGTGCGGGTGCACCGTGG 
AGTCATTCCAAGACTCCTGTCCTCACTCAGGGATTCTTCATTTCTTCTTCCTACTGCCTCCA 
CTTCATGTTATTTTCTTCCCTTCCCATTTACAACTAAAACTGACCAGAGCCCCAGGAATAAA 
TGGTTTTCTTGGCTTCCTCCTTACTCCCATCTGGACCCAGTCCCCTGGTTCCTGTCTGTTAT 
TTGTAAACTGAGGACCACAATAAAGAAATCTTTATATTTATCG 
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FIGURE 2QQ 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA73746 
xsubunit 1 of 1, 148 aa, 1 stop 
><MW: 16896, pi: 6.05, NX(S/T): 1 

MTKALLIYLVSSFLALNQASLISRCDLAQVLQLEDLDGFEGYSLSDWLCLAFVESKFNISKI 
NENADGSFDYGLFQINSHYWCNDYKSYSENLCHVDCQDLLNPNLLAGIHCAKRIVSGARGMN 
NWVEWRLHCSGRPLSYWLTGCRLR 

Signal peptide: 

amino acids 1-18 
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FIGURE 201 

TCTGACCTGACTGGAAGCGTCCAAAGAGGGACGGCTGTCAGCCCTGCTTGACTGAGAACCCA 
CCAGCTCATCCCAGACACCTCATAGCAACCTATTTATACAAAGGGGGAAAGAAACACCTGAG 
C AGAATGGAATCATTATTTTTTT CCCAAGGAG AAAACC GGGGTAAAGGGAGGGAAG C AATT C 
AATTTGAAGTCC CTGTGAATGGGCTTTC AG AAGG CAATT AAAGAAAT C CACT C AGAGAGGAC 
TTGGGGTGAAACTTGGGTCCTGTGGTTTTCTGATTGTAAGTGGAAGCAGGTCTTGCACACGC 
TGTTGGCAAATGTCAGGACCAGGTTAAGTGACTGGCAGAAAAACTTCCAGGTGGAACAAGCA 
ACCCATGTTCTGCTGCAAGCTTGAAGGAGCCTGGAGCGGGAGAAAGCTAACTTGAACATGAC 
CTGTTGCATTTGGCAAGTTCTAGCAACATGCTCCTAAGGAAGCGATACAGGCACAGACCATG 
CAGACTCCAGTTCCTCCTGCTGCTCCTGATGCTGGGATGCGTCCTGATGATGGTGGCGATGT 
TGCACCCTCCCCACCACACCCTGCACCAGACTGTCACAGCCCAAGCCAGCAAGCACAGCCCT 
GAAGCCAGGTACCGCCTGGACTTTGGGGAATCCCAGGATTGGGTACTGGAAGCTGAGGATGA 
GGGTGAAGAGTACAGCCCTCTGGAGGGCCTGCCACCCTTTATCTCACTGCGGGAGGATCAGC 
TGCTGGTGGCCGTGGCCTTACCCCAGGCCAGAAGGAACCAGAGCCAGGGCAGGAGAGGTGGG 
AG CT ACCGCCTC ATCAAGCAGC CAAGGAGGCAGGATAAGGAAGCCC CAAAGAGGGACTGGGG 
GG CTGATGAG GACGGGGAGGTGTCTGAAGAAGAGGAGTTGAC CC CGTT CAGC CTGG AC CC AC 
GTGGCCTCCAGGAGGCACTCAGTGCCCGCATCCCCCTCCAGAGGGCTCTGCCCGAGGTGCGG 
CACCCACTGTGTCTGCAGCAGCACCCTCAGGACAGCCTGCCCACAGCCAGCGTCATCCTCTG 
TTTCCATGATGAGGCCTGGTCCACTCTCCTGCGGACTGTACACAGCATCCTCGACACAGTGC 
CC AGGGCCTTCCTGAAGGAGATCATCCT CGTGGACGAC CT CAGCCAGC AAGG ACAACTCAAG 
TCTGCTCTCAGCGAATATGTGGCCAGGCTGGAGGGGGTGAAGTTACTCAGGAGCAACAAGAG 
GCTGGGTGCCATCAGGGCCCGGATGCTGGGGGCCACCAGAGCCACCGGGGATGTGCTCGTCT 
TCATGGATGCCCACTGCG AGTG C CAC CCAGGCTGGCTGGAGC CC CT CCTCAGCAGAATAGCT 
GGTGACAGGAGC CGAGTGGT AT CT CCGGTG ATAGATGTGATTGACTGGAAGACTTTCCAGTA 
TTACCCCTCAAAGGACCTGCAGCGTGGGGTGTTGGACTGGAAGCTGGATTTCCACTGGGAAC 
CTTTGCCAGAGCATGTGAGGAAGGCCCTCCAGTCCCCCATAAGCCCCATCAGGAGCCCTGTG 
GTGCCCGGAGAGGTGGTGGCCATGGACAGACATTACTTCCAAAACACTGGAGCGTATGACTC 
TCTTATGTCGCTGCGAGGTGGTGAAAACCTCGAACTGTCTTTCAAGGCCTGGCTCTGTGGTG 
GCTCTGTTGAAATCCTTCCCTGCTCTCGGGTAGGACACATCTACCAAAATCAGGATTCCCAT 
TCCCCCCTCGACCAGGAGGCCACCCTGAGGAACAGGGTTCGCATTGCTGAGACCTGGCTGGG 
GTC ATT CAAAGAAACCTTCT ACAAGCATAGC CC AGAGGC CTTCT CCTTGAGCAAGG CTGAGA 
AGCCAGACTGCATGGAACGCTTGCAGCTGCAAAGGAGACTGGGTTGTCGGACATTCCACTGG 
TTTCTGGCTAATGTCTACCCTGAGCTGTACCCATCTGAACCCAGGCCCAGTTTCTCTGGAAA 
GCTCCACAACACTGGACTTGGGCTCTGTGCAGACTGCCAGGCAGAAGGGGACATCCTGGGCT 
GTCCCATGGTGTTGGCTCCTTGCAGTGACAGCCGGCAGCAACAGTACCTGCAGCACACCAGC 
AGGAAGGAGATTCACTTTGGCAGCCCACAGCACCTGTGCTTTGCTGTCAGGCAGGAGCAGGT 
GATTCTTCAGAACTGCACGGAGGAAGGCCTGGCCATCCACCAGCAGCACTGGGACTTCCAGG 
AGAATGGGATGATTGTCCACATTCTTTCTGGGAAATGCATGGAAGCTGTGGTGCAAGAAAAC 
AATAAAGATTTGTACCTGCGTCCGTGTGATGGAAAAGCCCGCCAGCAGTGGCGATTTGACCA 
GATAAATGCTGTGGATGAACGATGAATGTCAAT GT CAGAAGGAAAAG AGAATTTTGGCCATC 
AAAATCCAGCTCCAAGTGAACGTAAAGAGCTTATATATTTCATGAAGCTGATCCTTTTGTGT 
GTGTGCTCCTTGTGTTAGGAGAGAAAAAAGCTCTATGAAAGAATATAGGAAGTTTCTCCTTT 
TCACACCTTATTTCATTGACTGCTGGCTGCTTA 
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FIGURE 202 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA73 760 
xsubunit 1 of 1, 63 9 aa, 1 stop 
><MW: 73063, pi: 6,84, NX(S/T): 2 

MLLRKRYRHRPCRLQFLLLLLMLGCVLMMVAMLHPPHHTLHQTVTAQASKHSPEARYRLDFG 
ESQDWVLEAEDEGEEYSPLEGLPPFISLREDQLLVAVALPQARRNQSQGRRGGSYRLIKQPR 
RQDKEAPKRDWGADEDGEVSEEEELTPFSLDPRGLQEALSARIPLQRALPEVRHPLCLQQHP 
QDSLPTASVILCFHDEAWSTLLRTVHSILDTVPRAFLKEI ILVDDLSQQGQLKSALSEYVAR 
LEGVKLLRSNKRLGAI RARMLGATRATGDVLVFMDAHCECHPGWLE PLLS RI AGDRSR WS P 
VIDVIDWKTFQYYPSKDLQRGVLDWKLDFHWEPLPEHVRKALQSPISPIRSPWPGEVVAMD 
RHYFQNTGAYDSLMSLRGGENLELSFKAWLCGGSVEILPCSRVGHIYQNQDSHSPLDQEATL 
RNRVRIAETWLGSFKETFYKHSPEAFSLSKAEKPDCMERLQLQRRLGCRTFHWFLANVYPEL 
YPSEPRPSFSGKLHNTGLGLCADCQAEGDILGCPMVLAPCSDSRQQQYLQHTSRKEIHFGSP 

QHLCFAVRQEQVILQNCTEEGLAIHQQHWDFQENGMIVHILSGKCMEAWQENNKDLYLRPC 
DGKARQQWRFDQ INAVDER 

Signal peptide: 

amino acids 1-28 



W ° 00/12708 PCT/US99/20111 



FIGURE 203 

CGCCAAGCATGCAGTAAAGGCTGAAAATCTGGGTCACAGCTGAGGAAGACCTCAGACATSGA 

GTCCAGGATGTGGCCTGCGCTGCTGCTGTCCCACCTCCTCCCTCTCTGGCCACTGCTGTTGC 

TGCCCCTCCCACCGCCTGCTCAGGGCTCTTCATCCTCCCCTCGAACCCCACCAGCCCCAGCC 

CGCCCCCCGTGTGCCAGGGGAGGCCCCTCGGCCCCACGTCATGTGTGCGTGTGGGAGCGAGC 

ACCTCCACCAAGCCGATCTCCTCGGGTCCCAAGATCACGTCGGCAAGTCCTGCCTGGCACTG 

CACCCCCAGCCACCCCATCAGGCTTTGAGGAGGGGCCGCCCTCATCCCAATACCCCTGGGCT 

ATCGTGTGGGGTCCCACCGTGTCTCGAGAGGATGGAGGGGACCCCAACTCTGCCAATCCCGG 

ATTTCTGGACTATGGTTTTGCAGCCCCTCATGGGCTCGCAACCCCACACCCCAACTCAGACT 

CCATGCGAGGTGATGGAGATGGGCTTATCCTTGGAGAGGCACCTGCCACCCTGCGGCCATTC 

CTGTTCGGGGGCCGTGGGGAAGGTGTGGACCCCCAGCTCTATGTCACAATTACCATCTCCAT 

CATCATTGTTCTCGTGGCCACTGGCATCATCTTCAAGTTCTGCTGGGACCGCAGCCAGAAGC 

GACGCAGACCCTCAGGGCAGCAAGGTGCCCTGAGGCAGGAGGAGAGCCAGCAGCCACTGACA 

GACCTGTCCCCGGCTGGAGTCACTGTGCTGGGGGCCTTCGGGGACTCACCTACCCCCACCCC 

TGACCATGAGGAGCCCCGAGGGGGACCCCGGCCTGGGATGCCCCACCCCAAGGGGGCTCCAG 

CCTTCCAGTTGAACCGGTQAGGGCAGGGGCAATGGGATGGGAGGGCAAAGAGGGAAGGCAAC 

TTAGGTCTTCAGAGCTGGGGTGGGGGTGCCCTCTGGATGGGTAGTGAGGAGGCAGGCGTGGC 

CTCCCACAGCCCCTGGCCCTCCCAAGGGGGCTGGACCAGCTCCTCTCTGGGAGGCACCCTTC 

CTTCTCCCAGTCTCTCAGGATCTGTGTCCTATTCTCTGCTGCCCATAACTCCAACTCTGCCC 

TCTTTGGTTTTTTCTCATGCCACCTTGTCTAAGACAACTCTGCCCTCTTAACCTTGATTCCC 

CCTCTTTGTCTTGAACTTCCCCTTCTATTCTGGCCTACCCCTTGGTTCCTGACTGTGCCCTT 

TCCCTCTTCCTCTCAGGATTCCCCTGGTGAATCTGTGATGCCCCCAATGTTGGGGTGCAGCC 

AAGCAGGAGGCCAAGGGGCCGGCACAGCCCCCATCCCACTGAGGGTGGGGCAGCTGTGGGGA 

GCTGGGGCCACAGGGGCTCCTGGCTCCTGCCCCTTGCACACCACCCGGAACACTCCCCAGCC 

CCACGGGCAATCCTATCTGCTCGCCCTCCTGCAGGTGGGGGCCTCACATATCTGTGACTTCG 

GGTCCCTGTCCCCACCCTTGTGCACTCACATGAAAGCCTTGCACACTCACCTCCACCTTCAC 

AGGCCATTTGCACACGCTCCTGCACCCTCTCCCCGTCCATACCGCTCCGCTCAGCTGACTCT 

CATGTTCTCTCGTCTCACATTTGCACTCTCTCCTTCCCACATTCTGTGCTCAGCTCACTCAG 

TGGTCAGCGTTTCCTGCACACTTTACCTCTCATGTGCGTTTCCCGGCCTGATGTTGTGGTGG 

TGTGCGGCGTGCTCACTCTCTCCCTCATGAACACCCACCCACCTCGTTTCCGCAGCCCCTGC 

GTGCTGCTCCAGAGGTGGGTGGGAGGTGAGCTGGGGGCTCCTTGGGCCCTCATCGGTCATGG 

TCTCGTCCCATTCCACACCATTTGTTTCTCTGTCTCCCCATCCTACTCCAAGGATGCCGGCA 

TCACCCTGAGGGCTCCCCCTTGGGAATGGGGTAGTGAGGCCCCAGACTTCACCCCCAGCCCA 

CTGCTAAAATCTGTTTTCTGACAGATGGGTTTTGGGGAGTCGCCTGCTGCACTACATGAGAA 

AGGGACTCCCATTTGCCCTTCCCTTTCTCCTACAGTCCCTTTTGTCTTGTCTGTCCTGGCTG 

TCTGTGTGTGTGCCATTCTCTGGACTTCAGAGCCCCCTGAGCCAGTCCTCCCTTCCCAGCCT 

CCCTTTGGGCCTCCCTAACTCCACCTAGGCTGCCAGGGACCGGAGTCAGCTGGTTCAAGGCC 

ATCGGGAGCTCTGCCTCCAAGTCTACCCTTCCCTTCCCGGACTCCCTCCTGTCCCCTCCTTT 

CCTCCCTCCTTCCTTCCACTCTCCTTCCTTTTGCTTCCCTGCCCTTTCCCCCTCCTCAGGTT 

CTTCCCTCCTTCTCACTGGTTTTTCCACCTTCCTCCTTCCCTTCTTCCCTGGCTCCTAGGCT 

GTGATATATATTTTTGTATTATCTCTTTCTTCTTCTTGTGGTGATCATCTTGAATTACTGTG 

GGATGTAAGTTTCAAAATTTTCAAATAAAGCCTTTGCAAGATAA 
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FIGURE 204 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA763 93 
xsubunit 1 of 1, 243 aa, 1 stop 
><MW: 26266, pi: 8.43, NX(S/T): 1 

MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGV 
PGRDGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIA 
ECTFTKMRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMN 
STINIHRTSSVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRI I IEELPK 

Signal peptide: 

amino acids 1-3 0 

Transmembrane domain : 

amino acids 195-217 
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FIGURE 205 



GTTAACCAGCGCAGTCCTCCGTGCGTCCCGCCCGCCGCTGCCCTCACTCCCQGCCAGGATGG 
CATCCTGTCTGGCCCTGCGCATGGCGCTGCTGCTGGTCTCCGGGGTTCTGGCCCCTGCGGTG 
CTCACAGACGATGTTCCACAGGAGCCCGTGCCCACGCTGTGGAACGAGCCGGCCGAGCTGCC 
GTCGGGAGAAGGCCCCGTGGAGAGCACCAGCCCCGGCCGGGAGCCCGTGGACACCGGTCCCC 
CAGCCCCCACCGTCGCGCCAGGACCCGAGGACAGCACCGCGCAGGAGCGGCTGGACCAGGGC 
GGCGGGTCGCTGGGGCCCGGCGCTATCGCGGCCATCGTGATCGCCGCCCTGCTGGCCACCTG 
CGTGGTGCTGGCGCTCGTGGTCGTCGCGCTGAGAAAGTTTTCTGCCTC CTGA AGCGAATAAA 
GGGGCCGCGCCCGGCCGCGGCGCGACTCGGCAAAAAAAAAAAAAAA 
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FIGURE 206 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA76398 
xsubunit 1 of 1, 121 aa, 1 stop 
><MW: 12073, pi: 4.11, NX(S/T): 0 

MASCLALRMALLLVSGVLAPAVLTDDVPQEPVPTLWNEPAELPSGEGPVESTSPGREPVDTG 
PPAPTVAPGPEDSTAQERLDQGGGSLGPGAIAAIVIAALLATCVVLALVVVALRKFSAS 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain : 

amino acids 91-110 

Glycosaminoglycan attachment site. 

amino acids 44-47 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 116-119 

N-myristoylation site. 

amino acids 91-96 
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FIGURE 207 



GGCCGTTGGTTGGTGCGCGGCTGAAGGGTGTGGCGCGAGCAGCGTCGTTGGTTGGCCGGCGG 

CGGGCCGGGACGGGCAIgGCCCTGCTGCTGTGCCTGGTGTGCCTGACGGCGGCGCTGGCCCA 

CGGCTGTCTGCACTGCCACAGCAACTTCTCCAAGAAGTTCTCCTTCTACCGCCACCATGTGA 

ACTTCAAGTCCTGGTGGGTGGGCGACATCCCCGTGTCAGGGGCGCTGCTCACCGACTGGAGC 

GACGACACGATGAAGGAGCTGCACCTGGCCATCCCCGCCAAGATCACCCGGGAGAAGCTGGA 

CCAAGTGGCGACAGCAGTGTACCAGATGATGGATCAGCTGTACCAGGGGAAGATGTACTTCC 

CCGGGTATTTCCCCAACGAGCTGCGAAACATCTTCCGGGAGCAGGTGCACCTCATCCAGAAC 

GCCATCATCGAAAGGCACCTGGCACCAGGCAGCTGGGGAGGAGGGCAGCTCTCCAGGGAGGG 

ACCCAGCCTAGCACCTGAAGGATCAATGCCATCACCCCGCGGGGACCTCCCCTAAGTAGGCr 

CCAGAGGCGCTGGGAGTGTTGCCACCGCCCTCCCCTGAAGTTTGCTCCATCTCACGCTGGGG 

GTCAACCTGGGGACCCCTTCCCTCCGGGCCATGGACACACATACATGAAAACCAGGCCGCAT 

CGACTGTCAGCACCGCTGTGGCATCTTCCAGTACGAGACCATCTCCTGCAACAACTGCACAG 

ACTCGCACGTCGCCTGCTTTGGCTATAACTGCGAGTAGGGCTCAGGCATCACACCCACCCGT 

GCCAGGGCCCTACTGTCCCTGGGGTCCCAGGCTCTCCTTGGAGGGGGCTCCCCGCCTTCCAC 

CTGGCTGTCATCGGGTAGGGCGGGGCCGTGGGTTCAGGGGCGCACCACTTCCAAGCCTGTGT 

CCCACAGGTCCTCGGCGCAGTGGAAGTCAGCTGTCCAGGGCCTCCTGAACTACATAAATAAC 

TGGCACAAGTAAGTCCCCTCCTCAAACCAACACAGGCAGTGTGTGTATGTGAGCACCTCGTG 

GGTGAGTATGTGTGGGGCACAGGCTGGCTCCCTGAGCTCCCACGTCCTAGAGGGGCTCCCGA 

GGAGGTGGAACCTCAACCCAGCTCTGCGCAGGAGGCGGCTGCAGTCCTTTTCTCCCTCAAAG 

GTCTCCGACCCTCAGCTGGAGGCGGGCATCTTTCCTAAAGGGTCCCCATAGGGTCTGGTTCC 

ACCCCATCCCAGGTCTGTGGTCAGAGCCTGGGAGGGTTCCCTACGATGGTTAGGGGTGCCCC 

ATGGAGGGGCTGACTGCCCCACATTGCCTTTCAGACAGGACACGAGCATGAGGTAAGGCCGC 

CCTGACCTGGACTTCAGGGGGAGGGGGTAAAGGGAGAGAGGAGGGGGGCTAGGGGGTCCTCT 

AGATCAGTGGGGGCACTGCAGGTGGGGCTCTCCCTATACCTGGGACACCTGCTGGATGTCAC 

CTCTGCAACCACACCCATGTGGTGGTTTCATGAACAGACCACGCTCCTCTGCCTTCTCCTGG 

CCTGGGACACACAGAGCCACCCCGGCCTTGTGAGTGACCCAGAGAAGGGAGGCCTCGGGAGA 

AGGGGTGCTCGTAAGCCAACACCAGCGTGCCGCGGCCTGCACACCCTTCGGACATCCCAGGC 

ACGAGGGTGTCGTGGATGTGGCCACACATAGGACCACACGTCCCAGCTGGGAGGAGAGGCCT 

GGGGCCCCCAGGGAGGGAGGCAGGGGGTGGGGGACATGGAGAGCTGAGGCAGCCTCGTCTCC 

CCGCAGCCTGGTATCGCCAGCCTTAAGGTGTCTGGAGCCCCCACACTTGGCCAACCTGACCT 

TGGAAGATGCTGCTGAGTGTCTCAAGCAGCACTGACAGCAGCTGGGCCTGCCCCAGGGCAAC 

GTGGGGGCGGAGACTCAGCTGGACAGCCCCTGCCTGTCACTCTGGAGCTGGGCTGCTGCTGC 

CTCAGGACCCCCTCTCCGACCCCGGACAGAGCTGAGCTGGCCAGGGCCAGGAGGGCGGGAGG 

GAGGGAATGGGGGTGGGCTGTGCGCAGGATCAGCGCCTGGGCAGGTCCGCAGAGCTGCGGGA 
TGTGATTAAAGTCCCTGATGTTTCTC 
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FIGURE 208 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA76399 
xsubunit 1 of 1, 157 aa, 1 stop 
><MW: 17681, pi: 7.65, NX(S/T): 1 

MALLLCLVCLTAALAHGCLHCHSNFSKKFSFYRHHVNFKSWWVGDIPVSGALLTDWSDDTMK 

ELHLAIPAKITREKLDQVATAVYQMMDQLYQGKMYFPGYFPNELRNIFREQVHLIQNAIIER 
HLAPGSWGGGQLSREGPSLAPEGSMPSPRGDLP 

Signal peptide: 

amino acids 1-15 
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FIGURE 209 

AGCAGGAGCAGGAGAGGGACAASGGAAGCTGCCCCGTCCAGGTTCATGTTCCTCTTATTTCT 

CCTCACGTGTGAGCTGGCTGCAGAAGTTGCTGCAGAAGTTGAGAAATCCTCAGATGGTCCTG 

GTGCTGCCCAGGAACCCACGTGGCTCACAGATGTCCCAGCTGCCATGGAATTCATTGCTGCC 

ACTGAGGTGGCTGTCATAGGCTTCTTCCAGGATTTAGAAATACCAGCAGTGCCCATACTCCA 

TAGCATGGTGCAAAAATTCCCAGGCGTGTCATTTGGGATCAGCACTGATTCTGAGGTTCTGA 

CACACTACAACATCACTGGGAACACCATCTGCCTCTTTCGCCTGGTAGACAATGAACAACTG 

AATTTAGAGGACGAAGACATTGAAAGCATTGATGCCACCAAATTGAGCCGTTTCATTGAGAT 

CAACAGCCTCCACATGGTGACAGAGTACAACCCTGTGACTGTGATTGGGTTATTCAACAGCG 

TAATTCAGATTCATCTCCTCCTGATAATGAACAAGGCCTCCCCAGAGTATGAAGAGAACATG 

CACAGATACCAGAAGGCAGCCAAGCTCTTCCAGGGGAAGATTCTCTTTATTCTGGTGGACAG 

TGGTATGAAAGAAAATGGGAAGGTGATATCATTTTTCAAACTAAAGGAGTCTCAACTGCCAG 

CTTTGGCAATTTACCAGACTCTAGATGACGAGTGGGATACACTGCCCACAGCAGAAGTTTCC 

GTAGAGCATGTGCAAAACTTTTGTGATGGATTCCTAAGTGGAAAATTGTTGAAAGAAAATCG 

TGAATCAGAAGGAAAGACTCCAAAGGTGGAACTCTSACTTCTCCTTGGAACTACATATGGCC 

AAGTATCTACTTTATGCAAAGTAAAAAGGCACAACTCAAATCTCAGAGACACTAAACAACAG 

GATCACTAGGCCTGCCAACCACACACACACGCACGTGCACACACGCACGCACGCGTGCACAC 

ACACACGCGCACACACACACACACACAGAGCTTCATTTCCTGTCTTAAAATCTCGTTTTCTC 

TTCTTCCTTCTTTTAAATTTCATATCCTCACTCCCTATCCAATTTCCTTCTTATCGTGCATT 

CATACTCTGTAAGCCCATCTGTAACACACCTAGATCAAGGCTTTAAGAGACTCACTGTGATG 

CCTCTATGAAAGAGAGGCATTCCTAGAGAAAGATTGTTCCAATTTGTCATTTAATATCAAGT 

TTGTATACTGCACATGACTTACACACAACATAGTTCCTGCTCTTTTAAGGTTACCTAAGGGT 

TGAAACTCTACCTTCTTTCATAAGCACATGTCCGTCTCTGACTCAGGATCAAAAACCAAAGG 

ATGGTTTTAAACACCTTTGTGAAATTGTCTTTTTGCCAGAAGTTAAAGGCTGTCTCCAAGTC 

CCTGAACTCAGCAGAAATAGACCATGTGAAAACTCCATGCTTGGTTAGCATCTCCAACTCCC 

TATGTAAATCAACAACCTGCATAATAAATAAAAGGCAATCATGTTATA 
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FIGURE 210 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA76401 
xsubunit 1 of 1, 273 aa, 1 stop 
><MW: 30480, pi: 4.60, NX (S/T) : 1 

MEAAPSRFMFLLFLLTCELAAEVAAEVEKSSDGPGAAQEPTWLTDVPAAMEFIAATEVAVIG 
FFQDLE I P AVP I LH SM VQKFPGVS FG I STDS E VLTH YN I TGNT I CLFRLVDNEQLNLEDED I 

ESIDATKLSRFIEINSLHMVTEYNPVTVIGLFNSVIQIHLLLIMNKASPEYEENMHRYQKAA 

KLFQGKILFILVDSGMKENGKVISFFKLKESQLPALAIYQTLDDEWDTLPTAEVSVEHVQNF 
CDGFLSGKLLKENRESEGKTPKVEL 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 143-162 
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FIGURE 211 



GGAGAGCCGCGGCTGGGACCGGAGTGGGGAGCGCGGCGTGGAGGTGCCACCCGGCGCGGGTG 
GCGGAGAGATCAGAAGCCTCTTCCCCAAGCCGAGCCAACCTCAGCGGGGACCCGGGCTCAGG 
GACGCGGCGGCGGCGGCGGCGACTGCAGTGGCTGGACGATGGCAGCGTCCGCCGGAGCCGGG 
GCGGTGATTGCAGCCCCAGACAGCCGGCGCTGGCTGTGGTCGGTGCTGGCGGCGGCGCTTGG 
GCTCTTGACAGCTGGAGTATCAGCCTTGGAAGTATATACGCCAAAAGAAATCTTCGTGGCAA 
ATGGTACACAAGGGAAGCTGACCTGCAAGTTCAAGTCTACTAGTACGACTGGCGGGTTGACC 
TCAGTCTCCTGGAGCTTCCAGCCAGAGGGGGCCGACACTACTGTGTCGTTTTTCCACTACTC 
CCAAGGGCAAGTGTACCTTGGGAATTATCCACCATTTAAAGACAGAATCAGCTGGGCTGGAG 
ACCTTGACAAGAAAGATGCATCAATCAACATAGAAAATATGCAGTTTATACACAATGGCACC 
TATATCTGTG ATGT CAAAAACC CT C CTGACATCGTTGT C CAG CCTGGAC ACATTAGGCTCTA 
TGTCGTAGAAAAAGAGAATTTGCCTGTGTTTCCAGTTTGGGTAGTGGTGGGCATAGTTACTG 
CTGTGGTCCTAGGTCTCACTCTGCTCATCAGCATGATTCTGGCTGTCCTCTATAGAAGGAAA 
AACTCTAAAC GGGATTAC ACTGG CTG CAGT ACAT CAGAGAGTTTGTCAC CAG TT AAGC AGGC 
TCCTCGGAAGTCCCCCTCCGACACTGAGGGTCTTGTAAAGAGTCTGCCTTCTGGATCTCACC 
AGGGCC CAGT CATATATG CACAGTTAGAC CACTC CGG CGGACAT CAC AGTGACAAG ATTAAC 
AAGTC AGAGT CTGTGGTGTATG CGGAT AT CCGAAAGAATTAAGAGAAT AC CT AG AACATAT C 
CT C AG C AAGAAACAAAAC CAAACTGGACT CTCGTGCAGAAAATGTAGCCCATTAC CACATGT 
AG CCTTGG AGAC CCAGGCAAGGACAAGTACACGTGTACT CACAG AGGG AGAG AAAGATGTGT 
ACAAAGGATATGTATAAATATT CT ATTTAGTCAT CCTG ATATGAGG AG CCAGTGTTGCATGA 
TGAAAAGATGGTATGATTCTACATATGTACCCATTGTCTTGCTGTTTTTGTACTTTCTTTTC 
AGGT CATTTACAATTGGGAGATTTCAGAAACATT CCTTT C AC CAT C ATTTAG AAATGGTTTG 
CCTTAATGGAGACAATAG CAGATC CTGTAGTATTTC CAGT AGACATGGCCTTTTAATC TAAG 
GGCTTAAGACTGATTAGTCTTAGCATTTACTGTAGTTGGAGGATGGAGATGCTATGATGGAA 
GCATACCCAGGGTGGCCTTTAGCACAGTATCAGTACCATTTATTTGTCTGCCGCTTTTAAAA 
AATACCCATTGGCTATGCCACTTGAAAACAATTTGAGAAGTTTTTTTGAAGTTTTTCTCACT 
AAAATATGGGGC^TTGTTAGCCTTACATGTTGTGTAGACTTACTTTAAGTTTGCACCCTTC 
AAATGTGTCATATCAATTTCTGGATTCATAATAGCAAGATTAGCAAAGGATAAATGCCGAAG 
GTCACTTCATTCTGGACACAGTTGGATCAATACTGATTAAGTAGAAAATCCAAGCTTTGCTT 
GAGAACTTTTGTAACGTGGAGAGTAAAAAGTATCGGTTTTA 
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FIGURE 212 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA76510 
xsubunit 1 of 1, 269 aa, 1 stop 
><MW: 29082, pi: 9.02, NX(S/T): 3 

MAASAGAGAVIAAPDSRRWLWSVLAAALGLLTAGVSALEVYTPKEIFVANGTQGKLTCKFKS 
TSTTGGLTSVSWSFQPEGADTTVSFFHYSQGQVYLGNYPPFKDRISWAGDLDKKDASINIEN 
MQ F I HNGT Y I CD VKNP PDIWQ PGH I RLYWE KENLP VFPVWVWG I VTAWLGLTLL I SM I 

LAVLYRRKNSKRDYTGCSTSESLSPVKQAPRKSPSDTEGLVKSLPSGSHQGPVIYAQLDHSG 
GHHSDKINKSESWYADIRKN 

Signal peptide: 

amino acids 1-3 7 

Transmembrane domain: 

amino acids 161-183 
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FIGURE 213 



GCCGGCTGTGCAGAGACGCCATGTACCGGCTCCTGTCAGCAGTGACTGCCCGGGCTGCCGCC 

CCCGGGGGCTTGGCCTCAAGCTGCGGACGACGCGGGGTCCATCAGCGCGCCGGGCTGCCGCC 

TCTCGGCCACGGCTGGGTCGGGGGCCTCGGGCTGGGGCTGGGGCTGGCGCTCGGGGTGAAGC 

TGGCAGGTGGGCTGAGGGGCGCGGCCCCGGCGCAGTCCCCCGCGGCCCCCGACCCTGAGGCG 

TCGCCTCTGGCCGAGCCGCCACAGGAGCAGTCCCTCGCCCCGTGGTCTCCGCAGACCCCGGC 

GCCGCCCTGCTCCAGGTGCTTCGCCAGAGCCATCGAGAGCAGCCGCGACCTGCTGCACAGGA 

TCAAGGATGAGGTGGGCGCACCGGGCATAGTGGTTGGAGTTTCTGTAGATGGAAAAGAAGTC 

TGGT CAGAAG GTTT AGGT TATG CTGATGTTGAG AAC CGTGTACC ATGTAAACCAGAGACAGT 

TATGCGAATTGCTAGCATCAGCAAAAGTCTCACCATGGTTGCTCTTGCCAAATTGTGGGAAG 

CAGGGAAACTGGATCTTGATATTCCAGTACAACATTATGTTCCCGAATTCCCAGAAAAAGAA 

TATGAAGGTGAAAAGGTTTCTGTCACAACAAGATTACTGATTTCCCATTTAAGTGGAATTCG 

TCATTATGAAAAGGACATAAAAAAGGTGAAAGAAGAG AAAGCTTATAAAG CCTT GAAGATG A 

TGAAAGAGAATGTTGCATTTGAGCAAGAAAAAGAAGGCAAAAGTAATGAAAAGAATGATTTT 

ACTAAATTTAAAACAGAGCAGGAGAATGAAGC CAAATG CCGG AATT CAAAAC CTGGCAAGAA 

AAAGAATGATTTTGAACAAGGCGAATTATATTTGAGAGAAAAGTTTGAAAATTCAATTGAAT 

CCCTAAGATTATTTAAAAATGATCCTTTGTTCTTCAAACCTGGTAGTCAGTTTTTGTATTCA 

ACTTTTGG CT AT AC CCTACTGG CAGCCATAGT AG AGAGAG CTTC AGGATGTAAATATTTGGA 

CTAT ATGC AGAAAATATT CC ATGACTTGGATATG CTGACGACTGTGCAGGAAGAAAACGAGC 

CAGTGATTTACAATAG AG CAAGGTAAATGAAT AC CTTC TGCTGTGTCTAGCTATATCG CATC 

TTAACACTATTTTATTAATTAAAAGTCAAATTTTCTTTGTTTCCATTCCAAAATCAACCTGC 

CACATTTTGGGAGCTTTTCTACATGTCTGTTTTCTCATCTGTAAAGTGAAGGAAGTAAAACA' 
TGTTTATAAAGTAAAAAAA 
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FIGURE 214 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA76522 
xsubunit 1 of 1, 373 aa, 1 stop 
><MW: 41221, pi: 8.54, NX(S/T): 0 

MYRLLSAVTARAAAPGGLASSCGRRGVHQRAGLPPLGHGWVGGLGLGLGLALGVKLAGGLRG 
AAPAQSPAAPDPEASPLAEPPQEQSLAPWSPQTPAPPCSRCFARAIESSRDLLHRIKDEVGA 
PGIWGVSVDGKEVWSEGLGYADVENRVPCKPETVMRIASISKSLTMVALAKLWEAGKLDLD 
I PVQHYVPEFPEKEYEGEKVSVTTRLLI SHLSGIRHYEKDIKKVKEEKAYKALKMMKENVAF 
EQEKEGKSNEKNDFTKFKTEQENEAKCRNSKPGKKKNDFEQGELYLREKFENSIESLRLFKN 
D PL FFKPGS QFLYS TFG YTLLAAI VE RAS GC K YLD YMQ K I FHDLDMLTTVQE ENE P VI YNRAR 

Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 39-60 
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FIGURE 215 

GTGACACTATAGAAGAGCTATGACGTCGCATGCACGCGTACGT7VAGCTCGGAATTCGGCTCG 

AGGCTGGTGGGAAGAAGCCGAGATGGCGGCAGCCAGCGCTGGGGCAACCCGGCTGCTCCTGC 

TCTTGCTGATGGCGGTAGCAGCGCCCAGTCGAGCCCGGGGCAGCGGCTGCCGGGCCGGGACT 

GGTGCGCGAGGGGCTGGGGCGGAAGGTCGAGAGGGCGAGGCCTGTGGCACGGTGGGGCTGCT 

GCTGGAGCACTCATTTGAGATCGATGACAGTGCCAACTTCCGGAAGCGGGGCTCACTGCTCT 

GGAACCAGCAGGATGGTACCTTGTCCCTGTCACAGCGGCAGCTCAGCGAGGAGGAGCGGGGC 

CGACTCCGGGATGTGGCAGCCCTGAATGGCCTGTACCGGGTCCGGATCCCAAGGCGACCCGG 

GGCCCTGGATGGCCTGGAAGCTGGTGGCTATGTCTCCTCCTTTGTCCCTGCGTGCTCCCTGG 

TGGAGTCGCACCTGTCGGACCAGCTGACCCTGCACGTGGATGTGGCCGGCAACGTGGTGGGC 

GTGTCGGTGGTGACGCACCCCGGGGGCTGCCGGGGCCATGAGGTGGAGGACGTGGACCTGGA 

GCTGTTCAACACCTCGGTGCAGCTGCAGCCGCCCACCACAGCCCCAGGCCCTGAGACGGCGG 

CCTTCATTGAGCGCCTGGAGATGGAACAGGCCCAGAAGGCCAAGAACCCCCAGGAGCAGAAG 

TCCTTCTTCGCCAAATACTGGATGTACATCATTCCCGTCGTCCTGTTCCTCATGATGTCAGG 

AGCGCCAGACACCGGGGGCCAGGGTGGGGGTGGGGGTGGGGGTGGTGGTGGGGGTAGTGGCC 

TTTGCTGTGTGCCACCCTCCCTGTAAGTCTATTTAAAAACATCGACGATACATTGAAATGTG 

TGAACGTTTTGAAAAGCTACAGCTTCCAGCAGCCAAAAGCAACTGTTGTTTTGGCAAGACGG 

TCCTGATGTACAAGCTTGATTGAAATTCACTGCTCACTTGATACGTTATTCAGAAACCCAAG 

GAATGGCTGTCCCCATCCTCATGTGGCTGTGTGGAGCTCAGCTGTGTTGTGTGGCAGTTTAT 
TAAACTGTCCCCCAGATCGACACGCAAAAAAAAA 
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FIGURE 216 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA76529 
xsubunit 1 of 1, 269 aa, 1 stop 
><MW: 28004, pi: 5.80, NX(S/T): 1 

MAAASAGATRLLLLLLMAVAAPSRARGSGCRAGTGARGAGAEGREGEACGTVGLLLEHSFEI 
DDSANFRKRGSLIiWNQQDGTLSLSQRQLSEEERGRLRDVAALNGLYRVRIPRRPGALDGLEA 
GGYVSSFVPACSLVESHLSDQLTLPIVDVAGNVVGVSWTHPGGCRGHEVEDVDLELFNTSVQ 
LQPPTTAPGPETAAFIERLEMEQAQKAKNPQEQKSFFAKYW^IIPVVLFLMMSGAPDTGGQ 
GGGGGGGGGGGSGLCCVPPSL 

Signal peptide: 

amino acids 1-24 

Transmembrane domain : 

amino acids 226-243 
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FIGURE 217 



GGAGCGCTGCTGGAACCCGAGCCGGAGCCGGAGCCACAGCGGGGAGGGTGGCCTGGCGGCCT 

GGAGCCGGACGTGTCCGGGGCGTCCCCGCAGACCGGGGCAGCAGGTCGTCCGGGGGCCCACC 

ATGCTGGTGACTGCCTACCTTGCTTTTGTAGGCCTCCTGGCCTCCTGCCTGGGGCTGGAACT 

GTCAAGATGCCGGGCTAAACCCCCTGGAAGGGCCTGCAGCAATCCCTCCTTCCTTCGGTTTC 

AACTGGACTTCTATCAGGTCTACTTCCTGGCCCTGGCAGCTGATTGGCTTCAGGCCCCCTAC 

CTCTATAAACTCTACCAGCATTACTACTTCCTGGAAGGTCAAATTGCCATCCTCTATGTCTG 

TGGCCTTGCCTCTACAGTCCTCTTTGGCCTAGTGGCCTCCTCCCTTGTGGATTGGCTGGGTC 

GCAAGAATTCTTGTGTCCTCTTCTCCCTGACTTACTCACTATGCTGCTTAACCAAACTCTCT 

CAAGACTACTTTGTGCTGCTAGTGGGGCGAGCACTTGGTGGGCTGTCCACAGCCCTGCTCTT 

CTCAGCCTTCGAGGCCTGGTATATCCATGAGCACGTGGAACGGCATGACTTCCCTGCTGAGT 

GGAT CC C AGCTAC CTTTG C T CGAG CTGC CTTCTGGAAC CATGTG CTGG CTGTAGTGGCAGGT 

GTGGCAGCTGAGGCTGTAGCCAGCTGGATAGGGCTGGGGCCTGTAGCGCCCTTTGTGGCTGC 

CATCCCTCTCCTGGCTCTGGCAGGGGCCTTGGCCCTTCGAAACTGGGGGGAGAACTATGACC 

GGCAGCGTGCCTTCTCAAGGACCTGTGCTGGAGGCCTGCGCTGCCTCCTGTCGGACCGCCGC 

GTGCTGCTGCTGGGCACCATACAAGCTCTATTTGAGAGTGTCATCTTCATCTTTGTCTTCCT 

CTGGACACCTGTGCTGGACCCACACGGGGCCCCTCTGGGCATTATCTTCTCCAGCTTCATGG 

CAGCCAGCCTGCTTGGCTCTTCCCTGTACCGTATCGCCACCTCCAAGAGGTACCACCTTCAG 

CCCATGCACCTGCTGTCCCTTGCTGTGCTCATCGTCGTCTTCTCTCTCTTCATGTTGACTTT 

CTCTAC CAGCCCAGGC CAGGAGAGTCCGGTGG AGTCCTTCATAGCCTTTCTACTTATTGAGT 

TGGCTTGTGGATTATACTTTCCCAGCATGAGCTTCCTACGGAGAAAGGTGATCCCTGAGACA 

GAGCAGGCTGGTGTACTCAACTGGTTCCGGGTACCTCTGCACTCACTGGCTTGCCTAGGGCT 

CCTTGTCCTCCATGACAGTGATCGAAAAACAGGCACTCGGAATATGTTCAGCATTTGCTCTG 

CTGTCATGGTGATGGCTCTGCTGGCAGTGGTGGGACTCTTCACCGTGGTAAGGCATGATGCT 

GAGCTGCGGGTACCTTCACCTACTGAGGAGCCCTATGCCCCTGAGCTGTAACCCCACTCCAG 

GACAAGATAGCTGGGACAGACTCTTGAATTCCAGCTATCCGGGATTGTACAGATCTCTCTGT 

GACTGACTTTGTGACTGTCCTGTGGTTTCTCCTGCCATTGCTTTGTGTTTGGGAGGACATGA 

TGGGGGTGATGGACTGGAAAGAAGGTGCCAAAAGTTCCCTCTGTGTTACTCCCATTTAGAAA 

ATAAACACTTTTAAATGATCAAAAAAAAAAAA 
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FIGURE 218 



MLVTAYIAFVGLLASCLGLELSRCR 

LYKLYQHYYFLEGQIAILYVCGLASTVLFGLVASSLVDWLGRKNSCVLFSLTYSLCCLTKLS 
QDYFVLLVGRALGGLS TALL FS AFEAWY I HEHVE RHD F PAEW I PAT FARAAFWNHVLAWAG 
VAAEAVASWI GLGP VAPFVAAI PLLALAGALALRNWGENYDRQRAFSRTGAGGLRCLLSDRR 
VLLLGTIQALFESVIFIFVFLWTPVLDPHGAPLGIIFSSFMAASLLGSSLYRIATSKRYHLQ 
PMHLLSLAVLIWFSLFMLTFSTSPGQESPVESFIAFLLIELACGLYFPSMSFLRRKVIPET 
EQAGVLNWFRVPLHSLACLGLLVLHDSDRKTGTRNMFS I CS AVMVMALLAWGLFTWRHDA 
ELRVPSPTEEPYAPEL 

Signal peptide: 

amino acids 1-18 

Transmembrane domain : 

amino acids 41-55, 75-94, 127-143, 191-213, 249-270, 278-299, 
314-330, 343-359, 379-394, 410-430 
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FIGURE 219 



GCGACGCGCGGCGGGGCGGCGAGAGGAAACGCGGCGCCGGGCCGGGCCCGGCCCTGGAGATG 
GTCCCCGGCGCCGCGGGCTGGTGTTGTCTCGTGCTCTGGCTCCCCGCGTGCGTCGCGGCCCA 
CGGCTTCCGTATCCATGATTATTTGTACTTTCAAGTGCTGAGTCCTGGGGACATTCGATACA 
TCTTCACAGCCACACCTGCCAAGGACTTTGGTGGTATCTTTCACACAAGGTATGAGCAGATT 
CACCTTGTCCCCGCTGAACCTCCAGAGGCCTGCGGGGAACTCAGCAACGGTTTCTTCATCCA 
GGACCAGATTGCTCTGGTGGAGAGGGGGGGCTGCTCCTTCCTCTCCAAGACTCGGGTGGTCC 
AGGAGCACGGCGGGCGGGCGGTGATCATCTCTGACAACGCAGTTGACAATGACAGCTTCTAC 
GTGGAGATGATCCAGGACAGTACCCAGCGCACAGCTGACATCCCCGCCCTCTTCCTGCTCGG 
CCGAGACGGCTACATGATCCGCCGCTCTCTGGAACAGCATGGGCTGCCATGGGCCATCATTT 
CCATCCCAGTCAATGTCACCAGCATCCCCACCTTTGAGCTGCTGCAACCGCCCTGGACCTTC 
TGGTAGAAGAGTTTGT CC CACATT CC AGC CATAAGTGACTCTG AGCTGGG AAGGGGAAAC CC 
AGGAATTTTG CT ACTTGGAATTTGGAGATAGCAT CTGGGGACAAGTGG AGC CAGGT AG AGGA 
AAAGGGTTTGGGCGTTGCTAGGCTGAAAGGGAAGCCACACCACTGGCCTTCCCTTCCCCAGG 
GC CC CC AAGGGTGTCT CATGCTACAAGAAGAGGCAAGAGACAGG C C C C AGGG CTTCTGGCTA 
GAACCCGAAACAAAAGGAGCTGAAGGCAGGTGGCCTGAGAGCCATCTGTGACCTGTCACACT 
CACCTGGCTCCAGCCTCCCCTACCCAGGGTCTCTGCACAGTGACCTTCACAGCAGTTGTTGG 
AGTGGTTTAAAGAGCTGGTGTTTGGGGACTCAATAAACCCTCACTGACTTTTTAGCAATAAA 
GCTTCTCATCAGGGTTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 220 



></usr/seqdb2/sst /DNA/Dnaseqs . min/ss . DNA76532 
xsubunit 1 of 1, 18 8 aa, 1 stop 
><MW: 21042, pi: 5.36, NX(S/T): 2 

MVPGAAGWCCLVLWLPACVAAHGFRI HD YLYFQVLS PGD I RY I FTATP AKDFGG I FHTRYEQ 
IHLVPAEPPEACGELSNGFFIQDQIALVERGGCSFLSKTRWQEHGGRAVIISDNAVDNDSF 
YVEMIQDSTQRTADIPALFLLGRIX5YMIRRSLEQHGLPWAIISIPVNVTSIPTFELLQPPWTFW 

Signal peptide: 

amino acids 1-20 




WO 00/12708 PCT/US99/2011I 

FIGURE 221 

TCTGCCTCCACTGCTCTGTGCTGGGATCATTSGAACTTGCACTGCTGTGTGGGCTGGTGGTGA 
TGGCTGGTGTGATTCCAATCCAGGGCGGGATCCTGAACCTGAACAAGATGGTCAAGCAAGTG 
ACTGGGAAAATGCCCATCCTCTCCTACTGGCCCTACGGCTGTCACTGCGGACTAGGTGGCAG 
AGGCCAACCCAAAGATGCCACGGACTGGTGCTGCCAGACCCATGACTGCTGCTATGACCACC 
TG AAGACC CAGGGGTG CG G CAT CT ACAAGG ACAACAACAAAAGC AG CATACATTGTATGGAT 

TTATCTCAACGCTATTGTTTAATGGCTGTGTTTAATGTGATCTATCTGGAAAATGAGGACTC 
CGAATAAAAAGCTATTACTAWTTNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 222 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA7653 8 
xsubunit 1 of l, 116 aa, 1 stop 
><MW: 12910, pi: 6.41, NX(S/T): 1 

MELALLCGLWMAGVI P I QGG I LNLNKMVKQVTGKMP I LS YWP YGCHCGLGGRGQPKDATDW 
CCQTHDCCYDHLKTQGCGIYKDNNKSSIHCMDLSQRYCLMAVFNVIYLENEDSE 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain ; 

amino acids 1-24 

N-glycosylation site. 

amino acids 86-89 

N-myristoylation sites. 

amino acids 20-25, 45-50 

Phospholipase A2 histidine active site. 

amino acids 63-70 
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FIGURE 223 



CTCGCTTCTTCCTTCTGGATGGGGGCCCAGGGGGCCCAGGAGAGTATAAAGGCGATGTGGAG 
GGTGCCCGGCACAACCAGACGCCCAGTCACAGGCGAGAGCCCTGGGATGCACCGGCCAGAGG 
CCATGCTGCTGCTGCTCACGCTTGCCCTCCTGGGGGGCCCCACCTGGGCAGGGAAGATGTAT 
GGCCCTGGAGGAGGCAAGTATTTCAGCACCACTGAAGACTACGACCATGAAATCACAGGGCT 
GCGGGTGTCTGTAGGTCTTCTCCTGGTGAAAAGTGTCCAGGTGAAACTTGGAGACTCCTGGG 
ACGTGAAACTGGGAGCCTTAGGTGGGAATACCCAGGAAGTCACCCTGCAGCCAGGCGAATAC 
ATCACAAAAGTCTTTGTCGCCTTCCAAGCTTTCCTCCGGGGTATGGTCATGTACACCAGCAA 
GGACCGCTATTTCTATTTTGGGAAGCTTGATGGCCAGATCTCCTCTGCCTACCCCAGCCAAG 
AGGGGCAGGTGCTGGTGGGCATCTATGGCCAGTATCAACTCCTTGGCATCAAGAGCATTGGC 
TTTGAATGGAATTATCCACTAGAGGAGCCGACCACTGAGCCACCAGTTAATCTCACATACTC 
AGCAAACTCACCCGTGGGTCGCTAGGGTGGGGTATGGGGCCATCCGAGCTGAGGCCATCTGT 

GTGGTGGTGGCTGATGGTACTGGAGTAACTGAGTCGGGACGCTGAATCTGAATCCACCAATA 
AATAAAG CTTCTG CAGAAAA 
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FIGURE 224 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA76541 
xsubunit 1 of 1, 178 aa, 1 stop 
><MW: 19600, pi: 5.89, NX (S/T) : 1 

MHRPEAMLLLLTLALLGGPTWAGKMYGPGGGKYFSTTEDYDHEITGLRVSVGLLLVKSVQVK 
LGDS WD VKLGALGGNTQE VTLQ PGEY I TKVFVAFQAFLRGMVMYTS KDRYFYFGKLDGQ I S S 
AYPSQEGQVLVGIYGQYQLLGIKSIGFEWNYPLEEPTTEPPVNLTYSANSPVGR 

Signal peptide: 

amino acids 1-22 



^ J2 7/ 2 7<f> 
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FIGURE 225 

GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCT 

GAACTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAG 

CCCCAAATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTT 

TTTTTTTTAACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATAT 

CCATGAAGATCCTATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATT 

TATTTGTTCTTGGAGTGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCT 

CCCAAGGGGTCCAATTTTTCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTG 

ACAGGGGCTGTCATGCAACTGGCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCTTTGAA 

CAATACAAAGGATGGGTTTCAATGTAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTT 

ATAGCCCCCACTGTCTTACTGACAATGCTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTG 

TAGGTGTGAAGGCAAAATGGTATATTGTGAATCTCAGAAATTACAGGAGATACCCTCAAGTA 

TATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCTATAACAGCCTTCAAAAACTTAAGTATAAT 

CAATTTAAAGGGCTCAACCAGCTCACCTGGCTATACCTTGACCATAACCATATCAGCAATAT 

TGACGAAAATGCTTTTAATGGAATACGCAGACTCAAAGAGCTGATTCTTAGTTCCAATAGAA 

TCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATTTACGGAACTTGGATCTGTCC 

TATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTGCGGAAGCTGCTGAGTTT 

ACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCAAGACTGCCGCAACC 

TGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATGTCTTTGCTGGC 

ATGATCAGACTCAAAGAACTTCACCTGGAGCACAATCAATTTTCCAAGCTCAACCTGGCCCT 

TTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGTCATAG 

GACAGACCATGTCCTGGACCTGGAGCTCCTTACAAAGGCTTGATTTATCAGGCAATGAGATC 

GAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGA 

TTCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACA 

TCAGTCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTG 

AAAAGTTTTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGG 

AGTAAATGTGATCGATGCAGTGAAGAACTACAGCATCTGTGGCAAAAGTACTACAGAGAGGT 

TTGATCTGGCCAGGGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCAT 

GAGAGCAAACCCCCTTTGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGC 

TGACGCCGAGCACATCTCTTTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCG 

TGCTCGTCATCCTGCTGGTTATCTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAG 

CTGCAGCAGCGCTCCCTCATGCGAAGGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAAT 

GACTCCCAGCACCCAGGAATTTTATGTAGATTATAAACCCACCAACACGGAGACCAGCGAGA 

TGCTGCTGAATGGGACGGGACCCTGCACCTATAACAAATCGGGCTCCAGGGAGTGTGAGGTA 

TGAACCATTGTGATAAAAAGAGCTCTTAAAAGCTGGGAAATAAGTGGTGCTTTATTGAACTC 

TGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCCTTCCCTCTCCCTCTCACTTTGGTGG 

CAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACTGGTCATTTTCCTCTCATACATA 

ATCAACCCATTGAAATTTAAATACCACAATCAATGTGAAGCTTGAACTCCGGTTTAATATAA 

TACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTTGTTTTAAGATAAAACT 
TCTTTCATAGGTAAAAAAAAAAA 
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FIGURK 226 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA77301 
xsubunit 1 of 1, 513 aa, 1 stop 
><MW: 58266, pi: 9.84, NX (S/T) : 4 

^fIr IRLLSGSAV ^ VIAPTVLLTMLSSAERGCPKGCRCEG ^ V YCESQK^^ 
™! LRYNSLQKLKYNQFKGLNQLTWLYLDHN HISNIDENAFNGIRRLKELILSSNRISYF 
^NTFRPVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELL 
IRS ^ NVFAGMIRLKELHLEHN Q FS KLNLALFPRLVSLQNLYLQWNKISVIGQTM 
SWTWS S LQRLDLS GNE I EAF SGPS VFQCVPNLQRLNLDSNKLTF I GQE I LDS W I SLND I SLA 
bitdvo^ ^ ^® LVNWLKS FKGLRENT 1 1 CAS PKELQGVNVI DAVKNYS I CGKSTTERFDLA 
RA k^^TFKPKLPRPKHESKPPLPPTVGATEPGPETDADAEHISFHKI IAGSVALFLSVLV^ 

gtg^cTSg^ 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 



Transmembrane domain: 

amino acids 420-442 

N-glycoeylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

CAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 



Z-2 7C> 
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FIGURE 227 



AGTTCTGAGAAAGAAGGAAATAAACACAGGCACCAAACCACTATCCTAAGTTGACTGTCCTT 

TAAATAJGTCAAGAT C CAGACTTTTC AGTGTCAC CT CAG CGAT CTC AACGAT AGGGATCTTG 

TGTTTGCCGCTATTCCAGTTGGTGCTCTCGGACCTACCATGCGAAGAAGATGAAATGTGTGT 

AAATTATAATGACCAACACCCTAATGGCTGGTATATCTGGATCCTCCTGCTGCTGGTTTTGG 

TGGCAGCTCTTCTCTGTGGAGCTGTGGTCCTCTGCCTCCAGTGCTGGCTGAGGAGACCCCGA 

ATTGATTCTCACAGGCGCACCATGGCAGTTTTTGCTGTTGGAGACTTGGACTCTATTTATGG 

GACAGAAGCAGCTGTGAGTCCAACTGTTGGAATTCACCTTCAAACTCAAACCCCTGACCTAT 

ATCCTGTTCCTGCTCCATGTTTTGGCCCTTTAGGCTCCCCACCTCCATATGAAGAAATTGTA 

AAAACAACCTGATTTTAGGTGTGGATTATCAATTTAAAGTATTAACGACATCTGTAATTCCA 

AAACATCAAATTTAGGAATAGTTATTTCAGTTGTTGGAAATGTCCAGAGATCTATTCATATA 

GTCTGAGGAAGGACAATTCGACAAAAGAATGGATGTTGGAAAAAATTTTGGTCATGGAGATG 

TTTAAATAGTAAAGTAGCAGGCTTTTGATGTGTCACTGCTGTATCATACTTTTATGCTACAC 

AACCAAATTAATGCTTCTCCACTAGTATCCAAACAGGCAACAATTAGGTGCTGGAAGTAGTT 

TCCATCACATTTAGGACTCCACTGCAGTATACAGCACACCATTTTCTGCTTTAAACTCTTTC 

CTAG CATGGGGTCCATAAAAATTATTATAATTTAAC AATAGC C C AAGC CGAG AATCCAAC AT 

GTCCAGAACCAGAACCAGAAAGATAGTATTTGAATGAAGGTGAGGGGAGAGAGTAGGAAAAA 

GAAAAGTTTGGAGTTGAAGGGTAAAGGATAAATGAAGAGGAAAAGGAAAAGATTACAAGTCT 

CAGCAAAAACAAGAGGTTTTATGCCCCAACCTGAAGAGGAAGAAATTGTAGATAGAAGGTGA 

AGGAGATTGCTGAAGATATAGAGCACATATAATGCCAACACGGGGAGAAAAGAAAATTTCCC 

CTTTTACAGTAATGAATGTGGCCTCCATAGTCCATAGTGTTTCTCTGGAGCCTCAGGGCTTG 

GCATTTATTGCAGCATCATGCTAAGAACCTTCGGCATAGGTATCTGTTCCCATGAGGACTGC 

AGAAGTAGCAATGAGACATCTTCAAGTGGCATTTTGGCAGTGGCCATCAGCAGGGGGACAGA 

CAAAAACATCCATCACAGATGACATATGATCTTCAGCTGACAAATTTGTTGAACAAAACAAT 
AAACATCAATAGATAT CTAAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA77303 
xsubunit 1 of 1, 146 aa, 1 stop 
><MW: 16116, pi: 4.99, NX(S/T): 0 

MSRSRLFSVTSAISTIGILCLPLFQLVLSDLPCEEDEMCVNYNDQHPNGWYIWILLLLVLVA 

ALLCGAWLCLQCWLRRPRIDSHRRTMAVFAVGDLDSIYGTEAAVSPTVGIHLQTQTPDLYP 
VPAPCFGPLGSPPPYEEIVKTT 

Signal peptide: 

amino acids 1-2 9 

Transmembrane domain : 

amino acids 52-70 



JZ3/ /2 70 
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FIGURE 229 

GAGCGGAGTAAAATCTCCACAAGCTGGGAACAAACCTCGTCCCAACTCCCACCCACCGGCGT 
TTCTCCAGCTCGATCTGGAGGCTGCTTCGCCAGTGTGGGACGCAGCTGACGCCCGCTTATTA 
GCTCTCGCTGCGTCGCCCCGGCTCAGAAGCTCCGTGGCGGCGGCGACCGTGACGAGAAGCCC 
ACGGCCAGCTCAGTTCTCTTCTACTTTGGGAGAGAGAGAAAGTCAGATGCCCCTTTTAAACT 
CCCTCTTCAAAACTCATCTCCTGGGTGACTGAGTTAATAGAGTGGATACAACCTTGCTGAAG 
ATGAAGAATATACAATATTGAGGATATTTTTTTCTTTTTTTTTTCAAGTCTTGATTTGTGGC 
TTACCTCAAGTTACCATTTTTCAGTCAAGTCTGTTTGTTTGCTTCTTCAGAAATGTTTTTTA 
CAATCTCAAGAAAAAATATGTCCCAGAAATTGAGTTTACTGTTGCTTGTATTTGGACTCATT 
TGGGGATTGATGTTACTGCACTATACTTTTCAACAACCAAGACATCAAAGCAGTGTCAAGTT 
ACGTGAG CAAAT ACTAGACTTAAG CAAAAG ATATGT TAAAGCT CTAG CAGAGGAAAATAAGA 

ACACAGTGGATGTCGAGAACGGTGCTTCTATGGCAGGATATGCGGATCTGAAAAGAACAATT 
GCTGTCCTTCTGGATGACATTTTGCAACGATTGGTGAAGCTGGAGAACAAAGTTGACTATAT 
TGTTGTGAATGGCTCAGCAGCCAACACCACCAATGGTACTAGTGGGAATTTGGTGCCAGTAA 
CCACAAATAAAAGAACGAATGT CT CGGGCAGTATCAGATAGCAGTTGAAAATCACCTTGTGC 
TGCTCCATCCACTGTGGATTATATCCTATGGCAGAAAAGCTTTATAATTGCTGGCTTAGGAC 
AG AG C AAT ACTT T AC AAT AAAAGCTCTAC AC ATTTTCAAGGAGTATG CTGGATT CATGGAAC 
TCTAATTCTGTACATAAAAATTTTAAAGTTATTTGTTTGCTTTCAGGCAAGTCTGTTCAATG 
CTGTACTATGTC CTTAAAGAGAATTTGGTAACTTGGTTG ATGTGGTAAG CAG ATAGGTGAGT 
TTTGTATAAATCTTTTGTGTTTGAGATCAAGCTGAAATGAAAACACTGAAAAACATGGATTC 
ATTTCTATAACACATTTATTTAAGTATATAACACGTTTTTTGGACAAGTGAAGAATGTTTAA 
TCATTCTGTCATTTGTTCTCAATAGATGTAACTGTTAGACTACGGCTATTTGAAAAAATGTG 
CTTATTGTACTATATTTTGTTATTCCAATTATGAGCAGAGAAAGGAAATATAATGTTGAAAA 
TAATGTTTTGAAATCATGACCCAAAGAATGTATTGATTTGCACT AT CCTT CAGAATAACTGA 
AGGTTAATTATTGTATATTTTTAAAAATTACACTTATAAGAGTATAATCTTGAAATGGGTAG 
CAGCCACTGTCCATTACCTATCGTAAACATTGGGGCAATTTAATAACAGCATTAAAATAGTT 
GTAAACTCTAATCTTATACTTATTGAAGAATAAAAGATATTTTTATGATGAGAGTAACAATA 
AAGTATTCATGATTTTTCACATACATGAATGTTCATTTAAAAGTTTAATCCTTTGAGTGTCT 
ATGCTATC AGGAAAG C ACATTATTTC CATATT TGGGTTAATTTTGCTTTTATTATATTGGTC 
TAGGAGGAAGGGACTTTGGAGAATGGAACTCTTGAGGACTTTAGCCAGGTGTATATAATAAA 
GGTACTTTTGTGCTGCATTAAATTGCTTGGAAAGTGTTAACATTATATTATATAAGAGTATC 
CTTTATGAAATTTTGAATTTGTATAACAGATGCATTAGATATTCATTTTATATAATGGCCAC 
TT AAAATAAG AACATTT AAAATATAAACTATGAAGATTGACTAT CTTTTC AGGAAAAAAG CT 
GTAT AT AG CACAGGGAAC C CTAATCTTGGGTAATT CTAGTATAAAACAAATTATACTTTTAT 
TTAAATTT CCCTTGTAGCAAAT CT AATTG CCACATGGTGC CCTATATTTCATAGTATTTATT 
CTCTATAGTAACTGCTTAAGTGCAGCTAGCTTCTAGATTTAGACTATATAGAATTTAGATAT 
TGTATTGTTCGTCAT TATAATATGCTAC CACATGTAG CAATAATTACAAT ATTTTATT AAAA 
TAAATATGTGAAATATTGTTTCATGAAAGACAGATT T CCAAATCTCTCTT CT CTTCTCTGTA 
CTGTCTACCTTTATGTGAAGAAATTAATTATATGCCATTGCCAGGT 
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FIGURE 230 

></usr/.seqdb2/sst/DNA/Dnaseqs . min/ss . DNA7764 8 
xsubunit 1 of l, 14 0 aa/l stop 
><MW: 15668, pi: 10,14, NX(S/T): 5 

MFFTI SRKNMSQKLSLLLLVFGL I WGLMLLHYTFQQ PRHQSS VKLREQ I LDLSKRYVKALAE 
ENKNTVDVENGASMAGYADLKRTI AVLLDD I LQRLVKLENKVDYI WNGSAANTTNGTSGNL 
VP VTTNKRTNVS G S I R 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 
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FIGURE 231 



CGCGGCCGGGCCGCCGGGGTGAGCGTGCCGAGGCGGCTGTGGCGCAGGCTTCCAGCCCCCAC 

CAESCCGTGGCCCCTGCTGCTGCTGCTGGCCGTGAGTGGGGCCCAGACAACCCGGCCATGCT 

TCCCCGGGTGCCAATGCGAGGTGGAGACCTTCGGCCTTTTCGACAGCTTCAGCCTGACTCGG 

GTGGATTGTAGCGGCCTGGGCCCCCACATCATGCCGGTGCCCATCCCTCTGGACACAGCCCA 

CTTGGACCTGTCCTCCAACCGGCTGGAGATGGTGAATGAGTCGGTGTTGGCGGGGCCGGGCT 

ACACGACGTTGGCTGGCCTGGATCTCAGCCACAACCTGCTCACCAGCATCTCACCCACTGCC 

TTCTCCCGCCTTCGCTACCTGGAGTCGCTTGACCTCAGCCACAATGGCCTGACAGCCCTGCC 

AGCCGAGAGCTTCACCAGCTCACCCCTGAGCGACGTGAACCTTAGCCACAACCAGCTCCGGG 

AGGTCTCAGTGTCTGCCTTCACGACGCACAGTCAGGGCCGGGCACTACACGTGGACCTCTCC 

CACAACCTCATTCACCGCCTCGTGCCCCACCCCACGAGGGCCGGCCTGCCTGCGCCCACCAT 

TCAGAGCCTGAACCTGGCCTGGAACCGGCTCCATGCCGTGCCCAACCTCCGAGACTTGCCCC 

TGCGCTACCTGAGCCTGGATGGGAACCCTCTAGCTGTCATTGGTCCGGGTGCCTTCGCGGGG 

CTGGGAGGCCTTACACACCTGTCTCTGGCCAGCCTGCAGAGGCTCCCTGAGCTGGCGCCCAG 

TGGCTTCCGTGAGCTACCGGGCCTGCAGGTCCTGGACCTGTCGGGCAACCCCAAGCTTAACT 

GGGCAGGAGCTGAGGTGTTTTCAGGCCTGAGCTCCCTGCAGGAGCTGGACCTTTCGGGCACC 

AACCTGGTGCCCCTGCCTGAGGCGCTGCTCCTCCACCTCCCGGCACTGCAGAGCGTCAGCGT 

GGGCCAGGATGTGCGGTGCCGGCGCCTGGTGCGGGAGGGCACCTACCCCCGGAGGCCTGGCT 

CCAGCCCCAAGGTGCCCCTGCACTGCGTAGACACCCGGGAATCTGCTGCCAGGGGCCCCACC 

ATCTTGTGACAAATGGTGTGGCCCAGGGCCACATAACAGACTGCTGTCCTGGGCTGCCTCAG 

GTCCCGAGTAACTTATGTTCAATGTGCCAACACCAGTGGGGAGCCCGCAGGCCTATGTGGCA 

GCGTCACCACAGGAGTTGTGGGCCTAGGAGAGGCTTTGGACCTGGGAGCCACACCTAGGAGC 

AAAGTCTCACCCCTTTGTCTACGTTGCTTCCCCAAACCATGAGCAGAGGGACTTCGATGCCA 

AACCAGACTCGGGTCCCCTCCTGCTTCCCTTCCCCACTTATCCCCCAAGTGCCTTCCCTCAT 

GCCTGGGCCGGCCTGACCCGCAATGGGCAGAGGGTGGGTGGGACCCCCTGCTGCAGGGCAGA 

GTTCAGGTCCACTGGGCTGAGTGTCCCCTTGGGCCCATGGCCCAGTCACTCAGGGGCGAGTT 

TCTTTTCTAACATAGCCCTTTCTTTGCCATGAGGCCATGAGGCCCGCTTCATCCTTTTCTAT 

TTCCCTAGAACCTTAATGGTAGAAGGAATTGCAAAGAATCAAGTCCACCCTTCTCATGTGAC 

agatggggaaactgaggccttgagaaggaaaaaggctaatctaagttcctgcgggcagtggc 
atgactggagcacagcctcctgcctcccagcccggacccaatgcactttcttgtctcctcta 

ATAAGCCCCACCCTCCCCGCCTGGGCTCCCCTTGCTGCCCTTGCCTGTTCCCCATTAGCACA 
GGAGTAGCAGCAGCAGGACAGGCAAGAGCCTCACAAGTGGGACTCTGGGCCTCTGACCAGCT 
GTGCGGCATGGGCTAAGTCACTCTGCCCTTCGGAGCCTCTGGAAGCTTAGGGCACATTGGTT 
CCAGCCTAGCCAGTTTCTCACCCTGGGTTGGGGTCCCCCAGCATCCAGACTGGAAACCTACC 
CATTTTCCCCTGAGCATCCTCTAGATGCTGCCCCAAGGAGTTGCTGCAGTTCTGGAGCCTCA 
TCTGGCTGGGATCTCCAAGGGGCCTCCTGGATTCAGTCCCCACTGGCCCTGAGCACGACAGC 
CCTTCTTACCCTCCCAGGAATGCCGTGAAAGGAGACAAGGTCTGCCCGACCCATGTCTATGC 
TCTACCCCCAGGGCAGCATCTCAGCTTCCGAACCCTGGGCTGTTTCCTTAGTCTTCATTTTA 
TAAAAGTTGTTGCCTTTTTAACGGAGTGTCACTTTCAACCGGCCTCCCCTACCCCTGCTGGC 
CGGGGATGGAGACATGTCATTTGTAAAAGCAGAAAAAGGTTGCATTTGTTCACTTTTGTAAT 
ATTGTCCTGGGCCTGTGTTGGGGTGTTGGGGGAAGCTGGGCATCAGTGGCCACATGGGCATC 
AGGGGCTGGCCCCACAGAGACCCCACAGGGCAGTGAGCTCTGTCTTCCCCCACCTGCCTAGC 
CCATCATCTATCTAACCGGTCCTTGATTTAATAAACACTATAAAAGGTTTAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 232 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA77652 
xsubunit 1 of 1, 353 aa, 1 stop 
><MW: 37847, pi: 6.80, NX(S/T): 2 

MPWPLLLLLAVSGAQTTRPCFPGCQCEVETFGLFDSFSLTRVDCSGLGPHIMPVPIPLDTAH 
LDLSSNRLEMVNESVLAGPGYTTLAGLDLSHNLLTS ISPTAFSRLRYLESLDLSHNGLTALP 
AESFTSSPLSDVNLSHNQLREVSVSAFTTHSQGRALHVDLSHNLIHRLVPHPTRAGLPAPTI 
QSLNLAWNRLHAVPNLRDLPLRYLSLDGNPLAVIGPGAFAGLGGLTHLSLASLQRLPELAPS 
GFRELPGLQVLDLSGNPKLNWAGAEVFSGLSSLQELDLSGTNLVPLPEALLLHLPALQSVSV 
GQDVRCRRLVREGTYPRRPGSSPKVPLHCVDTRESAARGPTIL 

Signal peptide: 

amino acids 1-16 

Transmembrane domains: 

amino acids 215-232, 287-304 
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FIGURE 233 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGG 

GGACCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCA 

AAACAAGTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCT 

GTTCCAGGCCTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGC 

CACGACCTGTGCCACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCAC 

GGGAGGCTTGGCAGTTTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCC 

TCTAGTCTTGCCTTCAGCCTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGG 

ACTGAAGACACTCAATTTGGGAAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATG 

GATTTTCTGAGATACGGGGCAGTGTGCAAGCCAAAGATGGAAACATTGACATCAGAATCTTA 

AGGAGGACTGAGTCTTTGCAAGACACAAAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTT 

GCTAAGACTCTATCTGGACAGGGTATTTAAAAACTACCAGACCCCTGACCATTATACTCTCC 

GGAAGATCAGCAGCCTCGCCAATTCCTTTCTTACCATCAAGAAGGACCTCCGGCTCTCTCAT 

GCCCACATGACATGCCATTGTGGGGAGGAAGCAATGAAGAAATACAGCCAGATTCTGAGTCA 

CTTTGAAAAGCTGGAACCTCAGGCAGCAGTTGTGAAGGCTTTGGGGGAACTAGACATTCTTC 

TGCAATGGATGGAGGAGACAGAA^ASGAGGAAAGTGATGCTGCTGCTAAGAATATTCGAGGT 

CAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAAACCACCATCTCTTTACT 

GTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTGCTTCCTTGCATGAT 

TGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTTTGTAATATCTT 

TCTGCTATTGGATATATTTATTAGTTAATATATTTATTTATTTTTTGCTATTTAATGTATTT 

ATTTTTTTACTTGGACATGAAACTTTAAAAAAATTCACAGATTATATTTATAACCTGACTAG 

AGCAGGTGATGTATTTTTATACAGTAAAAAAAAAAAACCTTGTAAATTCTAGAAGAGTGGCT 

AGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGA 

TATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATT 

GCACATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTAT 

CTTCCAGCCAGGAATCCTACACGGCCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATA 
CCAAAAAAAAAAAAAAAAAAA 



# 
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FIGURE 234 



></usr/seqdb2/sst/DNA/Dnaseqs . min/ss .DNA83500 
xsubunit 1 of 1, 261 aa, 1 stop 
><MW: 29667, pi: 8.76, NX(S/T): 0 

MRQFPKTS FD I S PEMS FS I YSLQVPAVPGLTCWALTAE PGWGQNKGATTCATNSHSDSELRP 
EIFSSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNL 
QE IRNGFS E I RGS VQAKDGNID I R I LRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTP 
DHYTLRKI SS LANS FLT I KKDLRLSHAHMTCHCGEEAMKKYSQ I LSHFEKLE PQAAWKALG 
ELDILLQWMEETE 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

CAMP- and cGMP- dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 235 



CCGTTATCGTCTTGCGCTACTGCTGAATCTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCG 

CTGACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGC 

CGAGCTAGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAG 

CTCTTGCTCGGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACA 

GCCCTAGGGATCATTGAAGAGGAAGGCTTTCTAAAGCTTTGGCAAGGAGTGACACCCGCCAT 

TTACAGACACGTAGTGTATTCTGGAGGTCGAATGGTCACATATGAAGATCTCCGAGAGGTTG 

TGTTTGGCAAAAGTGAAGATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATG 

GCTGGTGTTATTGGCCAGTTTTTAGCCAATCCAACTGACCTAGTGAAGGTTCAGATGCAAAT 

GGAAGGAAAAAGGAAACTGGAAGGAAAACCATTGCGATTTCGTGGTGTACATCATGCATTTG 

CAAAAATCTTAGCTGAAGGAGGAATACGAGGGCTTTGGGCAGGCTGGGTACCCAATATACAA 

AGAGCAGCACTGGTGAATATGGGAGATTTAACCACTTATGATACAGTGAAACACTACTTGGT 

ATTGAATACACCACTTGAGGACAATATCATGACTCACGGTTTATCAAGTTTATGTTCTGGAC 

TGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCATCAAAAGCAGAATAATGAATCAACCA 

CGAGATAAACAAGGAAGGGGACTTTTGTATAAATCATCGACTGACTGCTTGATTCAGGCTGT 

TCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACCATCTTGGCTGAGAATGACCC 

CTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGATGAGTGGAGTCAGTCCA 
TT TTAA 
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FIGURE 236 



></usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA77568 ' 
xsubunit 1 of 1, 323 aa, 1 stop 
><MW: 36064, pi: 9.33, NX(S/T): 1 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAAIARLGDGA 

RESAPYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEH 

YPLWKSVIGGMMAGVIGQFLANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKILAEGGI 

RGLWAGWVPNIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTP 

ADVIKSRIMNQPRDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLT 
YEKIREMSGVSPF 

Transmembrane domains : 

amino acids 25-38, 130-147, 233-248 
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FIGURE 237 



CGGACGCGTGGGCGCGGGACGCCGGCAGGGTTGTGGCGCAGCAGTCTCCTTCCTGCGCGCGC 
GCCTGAAGTCGGCGTGGGCGTTTGAGGAAGCTGGGATACAGCATTTAATGAAAAATTTATGC 
TTAAGAAGTAAAAATGGCAGGCTTCCTAGATAATTTTCGTTGGCCAGAATGTGAATGTATTG 
ACTGGAGTGAGAGAAGAAATGCTGTGGCATCTGTTGTCGCAGGTATATTGTTTTTTACAGGC 
TGGTGGATAATGATTGATGCAGCTGTGGTGTATCCTAAGCCAGAACAGTTGAACCATGCCTT 
TCACACATGTGGTGTATTTTCCACATTGGCTTTCTTCATGATAAATGCTGTATCCAATGCTC 
AGGTGAGAGGTG AT AG CTATGAAAGCGG CTGT TT AGGAAG AACAGGTG CT CGAGTTTGG CTT 
TTCATTGGTTTCATGTTGATGTTTGGGTCACTTATTGCTTCCATGTGGATTCTTTTTGGTGC 
ATATGTTACCCAAAATACTGATGTTTATCCGGGACTAGCTGTGTTTTTTCAAAATGCACTTA 
TATTTTTTAGCACTCTGATCTACAAATTTGGAAGAACCGAAGAGCTATGGACCTGAGATCAC 
TTCTTAAGTCACATTTTCCTTTTGTTATATTCTGTTTGTAGATAGGTTTTTTATCTCTCAGT 
ACACATTGCCAAATGGAGTAGATTGTACATTAAATGTTTTGTTTCTTTACATTTTTATGTTC 
TGAG TT TT GAAATAGTTTTATGAAATTTCTTT ATTTTTCATTG CATAG ACTGTTAAT ATGTA 
TATAATAC AAGACTATATGAATTGGATAATGAGTAT CAGTTTTTTATT CCTG AGATTTAGAA 
CTTGATCTACTCCCTGAGCCAGGGTTACATCATCTTGTCATTTTAGAAGTAACCACTCTTGT 
CTCTCTGGCTGGGCACGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGG 
CCGATTGCTTGAGGT CAAGTGTTT GAGAC C AG C CTGGC CAAC ATGG CG AAAC CC CAT CTACT 
AAAAATAGAAAAATTAGCCAGGCATGGTGGTGGGTGCCTGTAATCCCAGCTACCTGGGAGGC 
TGAGGCAGGAGAATCGCTTGAACCCGGGGGGCAGAGGTTGCAGTGAGCTGAGTTTGCGCCAC 
TGCACTCTAGCCTGGGGGAGAAAGTGAAACTCCCTCTCAAAAAAAAGACCACTCTCAGTATC 
TCTGATTTCTGAAGATGTACAAAAAAATATAGCTTCATATATCTGGAATGAGCACTGAGCCA 
TAAAAGGTTTTCAGCAAGTTGTAACTTATTTTGGCCTAAAAATGAGGTTTTTTTGGTAAAGA 
AAAAATATTTGTTCTTATGTATTGAAGAAGTGTACTTTTATATAATGATTTTTTAAATGCCC 
AAAGGACTAGTTTGAAAGCTTCTTTTAAAAAGAATTCCTCTAATATGACTTTATGTGAGAA 
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FIGURE 238 



MAGFLDNFRWPECECIDWSERRNAVASWAGILFFTGWWIMIDAAWYPKPEQLNHAFHTCG 
VFSTLAFFMINAVSNAQVRGDSYESGCLGRTGARVWLFIGFMLMFGSLIASMWILFGAYVTQ 
NTDVYPGLAVFFQNALIFFSTLIYKFGRTEELWT 

Important features: 
Signal peptide: 

amino acids 1-44 

Transmembrane domains: 

amino acids 23-42 (type II), 60-80, 97-117, 128-148 
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FIGURE 239 

GTTGATGGCAAACTTCCTCAAAGGAGGGGCAGAGCCTGCGCAGGGCAGGAGCAGCTGGCCCA 

CTGGCGGCCCGCAACACTCCGTCTCACCCTCTGGGCCCACTGCATCTAGAGGAGGGCCGTCT 

GTGAGGCCACTACCCCTCCAGCAACTGGGAGGTGGGACTGTCAGAAGCTGGCCCAGGGTGGT 

GGTCAGCTGGGTCAGGGACCTACGGCACCTGCTGGACCACCTCGCCTTCTCCATCGAAGCAG 

GGAAGTGGGAGCCTCGAGCCCTCGGGTGGAAGCTGACCCCAAGCCACCCTTCACCTGGACAG 

GATGAGAGTGTCAGGTGTGCTTCGCCTCCTGGCCCTCATCTTTGCCATAGTCACGACATGGA 

TGTTTATTCGAAGCTACATGAGCTTCAGCATGAAAACCATCCGTCTGCCACGCTGGCTGGCA 

GCCTCGCCCACCAAGGAGATCCAGGTTAAAAAGTACAAGTGTGGCCTCATCAAGCCCTGCCC 

AGCCAACTACTTTGCGTTTAAAATCTGCAGTGGGGCCGCCAACGTCGTGGGCCCTACTATGT 

GCTTTGAAGACCGCATGATCATGAGTCCTGTGAAAAACAATGTGGGCAGAGGCCTAAACATC 

GCCCTGGTGAATGGAACCACGGGAGCTGTGCTGGGACAGAAGGCATTTGACATGTACTCTGG 

AGATGTTATGCACCTAGTGAAATTCCTTAAAGAAATTCCGGGGGGTGCACTGGTGCTGGTGG 

CCTCCTACGACGATCCAGGGACCAAAATGAACGATGAAAGCAGGAAACTCTTCTCTGACTTG 

GGGAGTTCCTACGCAAAACAACTGGGCTTCCGGGACAGCTGGGTCTTCATAGGAGCCAAAGA 

CCTCAGGGGTAAAAGCCCCTTTGAGCAGTTCTTAAAGAACAGCCCAGACACAAACAAATACG 

AGGGATGGCCAGAGCTGCTGGAGATGGAGGGCTGCATGCCCCCGAAGCCATTTTAGGGTGGC 

TGTGGCTCTTCCTCAGCCAGGGGCCTGAAGAAGCTCCTGCCTGACTTAGGAGTCAGAGCCCG 

GCAGGGGCTGAGGAGGAGGAGCAGGGGGTGCTGCGTGGAAGGTGCTGCAGGTCCTTGCACGC 

TGTGTCGCGCCTCTCCTCCTCGGAAACAGAACCCTCCCACAGCACATCCTACCCGGAAGACC 

AGCCTCAGAGGGTCCTTCTGGAACCAGCTGTCTGTGGAGAGAATGGGGTGCTTTCGTCAGGG 

ACTGCTGACGGCTGGTCCTGAGGAAGGACAAACTGCCCAGACTTGAGCCCAATTAAATTTTA 

TTTTTGCTGGTTTTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE 240 

</usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA59814 
oubunit 1 of 1, 224 aa, l stop 
<MW: 24963, pi: 9.64, NX(S/T): 1 

MRVSGVLRLLAL I FAI VTTWMF I RS YMS FSMKT I RLPRWLAAS PTKE I QVKKYKCGL I KPCP 
ANYFAFKICSGAANWGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQKAFDMYSG 

DVMHLVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFRDSWVFIGAKD 
LRGKSPFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF 

Important features : 
Signal peptide: 

amino acids 1-15 

ATP/ GTP - binding site motif A (P-loop) . 

amino acids 184-191 

N-glycosylation site. 

amino acids 107-110 
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FIGURE 241 

GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGAT 

CCAGAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACA 

GCAAGAGAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTC 

TCACTCCTCCCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCC 

CTGCACCCCTTCCTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCC 

TGGCTGCAGATGGGGGTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCA 

GCCTCTTACCCTGAGTGTGGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGT 

GACATTTGACCCTGATTTGCCTGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGC 

CTTTGGACCTGCACAACAATGGCCACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTG 

GGTGGACTTCCCCGAAAATATGTAGCTGCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATC 

CCCAGGGGGGTCAGAACACCAGATCAACAGTGAAGCCACATTTGCAGAGCTCCAGATTGTAC 

ATTATGACTCTGATTCCTATGACAGCTTGAGTGAGGCTGCTGAGAGGCCTCAGGGCCTGGCT 

GTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAGAATATAGCTTATGAACACATTCTGAG 

TCACTTGCATGAAGTCAGGCATAAAGATCAGAAGACCTCAGTGCCTCCCTTCAACCTAAGAG 

AGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATGGCTCGCTCACAACTCCCCCT 

TGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAGATTTCAATGGAACAGCT 

GGAAAAGCTTCAGGGGACATTGTTCTCCACAGAAGAGGAGCCCTCTAAGCTTCTGGTACAGA 

ACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCATCCAAGCAGGA 

TCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGCTGTCTCTG 

CCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAACCGAA 

AGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGCATAAATTCCTTCTCAGATACCA 

TGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTG 

GCCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAG 

GAATGGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTA 

GGAGGAAATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGG 

GAAGTTTGGGATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATA 

TACTGCGGGATCTCTCCTTAGGATAAAGAGTTGCTGTTGAAGTTGTATATTTTTGATCAATA 
TATTTGGAAATTAAAGTTTCTGACTTT 
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FIGURE 242 



></usr/seqdb2/ sst/DNA/Dnaseqs . min/ss . DNA62812 
xsubunit 1 of 1, 337 aa, 1 stop 
><MW: 37668, pi: 6.27, NX(S/T): 1 

MLFS ALLLEVI W I LAADGGQHWTYEGPHGQDHWPAS YPE CGNNAQS P I D I QTDS VTFDPDLP 
ALQPHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQ 
INSEATFAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRH 
KDQKTSVPPFNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTL 
FSTEEE PS KLLVQNYRALQPLNQRMVFAS F I QAGSS YTTGEMLSLGVG I LVGCLCLLLAVYF 
IARKIRKKRLENRKSWFTSAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 243 

AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTC 

GTGGACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGATTCTACTGTTTTGTCTTCTAGG 

ATCAACTCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTC 

CGGATCAGGGAACACTACCAAACCAACAGCAGTCAAATCAGGTCTTTCCTTCTTTAAGTCTG 

ATACCATTAACACAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGG 

AATGACACCTGGTACCCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAAC 

TGCACCCACATGTGTTACCAATTTTTGTCACACAACTTGGAGCCCAGGGCACTATCCTAAGC 

TCAGAGGAATTGCCACAAATCTTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCAT 

CCTGCCCACCAGTCAGGCAGGGGCTAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAG 

GAGCAGGTGTAAATCCTGCCACCCAGGGAACCCCAGCAGGCCGCCTCCCAACTCeCAGTGGC 

ACAGATGACGACTTTGCAGTGACCACCCCTGCAGGCATCCAAAGGAGCACACATGCCATCGA 

GGAAGCCACCACAGAATCAGCAAATGGAATTCAGTAAGCTGTTTCAAATTTTTTCAACTAAG 

CTGCCTCGAATTTGGTGATACATGTGAATCTTTATCATTGATTATATTATGGAATAGATTGA 

GACACATTGGATAGTCTTAGAAGAAATTAATTCTTAATTTACCTGAAAATATTCTTGAAATT 

TCAGAAAATATGTTCTATGTAGAGAATCCCAACTTTTAAAAACAATAATTCAATGGATAAAT 

CTGTCTTTGAAATATAACATTATGCTGCCTGGATGATATGCATATTAAAACATATTTGGAAA 

ACTGGAAAAAAAAAAA ^^ 

AAAAAAAAAAAAAAAAAAA 
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FIGURE 244 



MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 

LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 

LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
TDDD FAVTTPAG I QRSTHA I EEATTE SANG I Q 

Signal peptide: 

amino acids 1-16 
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FIGURE 245 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCG 

GAGCCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCG 

GCAGCCGGGAGCCATSCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCC 

TGCTGCTCCTGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAG 

CAAAAGGCGCAGCTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGG 

GCCAGCAGGAGTGCCTGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTG 

GGATCCCAGGTCGGGATGGATTCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCTTTGAG 

GAGTCCTGGACACCCAACTACAAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCT 

TGGGAAAATTGCGGAGTGTACATTTACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGT 

TCAGTGGCTCACTTCGGCTAAAATGCAGAAATGCATGCTGTCAGCGTTGGTATTTCACATTC 

AATGGAGCTGAATGTTCAGGACCTCTTCCCATTGAAGCTATAATTTATTTGGACCAAGGAAG 

CCCTGAAATGAATTCAACAATTAATATTCATCGCACTTCTTCTGTGGAAGGACTTTGTGAAG 

GAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGGTTGGCACTTGTTCAGATTACCCAAAA 

GGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATCATTATTGAAGAACTACCAAAATA 



GACATTTTAAATAAGTTTATGTATACATCTGAATGAAAAGCAAAGCTAAATATGTTTACAGA 

CCAAAGTGTGATTTCACACTGTTTTTAAATCTAGCATTATTCATTTTGCTTCAATCAAAAGT 

GGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATTCTCTCAACCTA 

TAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTAAAAAAATA 

TAAAAGCTACCAATCTTTGTACAATTTGTAAATGTTAAGAATTTTTTTTATATCTGTTAAAT 
AAAAATTATTTCCAACA 
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FIGURF 24* 



></usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA76393 
xsubunit 1 of l, 243 aa, 1 stop 
><MW: 26266, pi: 8.43, NX(S/T): 1 

MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGV 
PGRDGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIA 
ECTFTKMRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMN 
STINIHRTSSVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRI I IEELPK 



Signal peptide: 

amino acids 1-30 



Transmembrane domain : 

amino acids 195-217 
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FIGURE 247 A 

PRO XXXXXXXXXXXXXXX (Length = 15 amino acids) 

Comparison Protein XXXXXYYYYYYY (Length = 12 amino acids) 

% amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide 
sequences as determined by ALIGN-2) divided by (the total number of amino acid residues of 
the PRO polypeptide) = 

5 divided by 15 = 33.3% 
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FIGURE 247R 

PR ° XXXXXXXXXX (Length = 10 amino acids) 

Comparison Protein XXXXXYYYYYYZZYZ (Length = 15 amino acids) 

% amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide 
sequences as determined by ALIGN-2) divided by (the total number of amino acid residues of 
the PRO polypeptide) = 

5 divided by 10 = 50% 



• 
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FIGURE 247P 

PRO-DNA NNNNNNNNNNNNNN (Length = 14 nucleotides) 

Comparison DNA NNNNNNLLLLLLLLLL (Length = 16 nucleotides) 

% nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as 
determined by ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic 
acid sequence) = 



6 divided by 14 = 42.9% 
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PRO-DNA 
Comparison DNA 



FIGURE 247H 



NNNNNNNNNNNN 
NNNNLLLVV 



(Length = 12 nucleotides) 

(Length = 9 nucleotides) 



% nucleic acid sequence identity = 



(the number of identically matching nucleotides between the two nucleic acid sequences as 
determined by ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic 
acid sequence) = 



4 divided by 12 = 33.3% 



^53 /JZPd> 
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FIGURK 24SA 



i* 



* C-C increased from 12 to 15 

* Z is average of EQ 

* B is average of ND 

•^rnatch with stop is _M; stop-stop = 0; J (joker) match = 0 
#define _M -8 /♦ value of a match with a stop ♦/ 

int _day[26][26] = { 

£ a »/ A B /f n°o E / GHIJKL MNOPQRSTUVWXYZ*/ 
i* r J, \ I' °' °'- 4 ' °-- 1 '- 2 '- 1 - °'- M - °-- 2 - »• I- 0. 0,-6, 0,-3, 0}, 

* n « -2.-4.15.-5.-5.-4,-3.-3,-2. 0.-5.-6,-5,-4._M,-3.-5.-4. 0.-2. 0.-2.-8, 0, 0.-5} 
S P * J «• J' 3 '- 6 ' U 1 '" 2 ' °- °-- 4 '- 3 - 2 -- M -- 1 - 2.-1. 0. 0. 0,-2.-7. 0.-4. 2) 

* p . , { ?' 3> 4 '- 5, °" 1 '"2' °- °'- 3 --2. l.-M.-I. 2.-1. 0, 0, 0.-2.-7, 0 -4 3} 
i, £ 1 H.-5.-4.-6.-5. 9,-5.-2. I. 0.-5. 2. 0.-4. M.-5,-5.-4.-3,-3, 0-1 0 0 7 -5} 
«2! 1 ' 0 -" 3 ' 1 '°-- 5 - 5 --2.-3. 0,-2.-4,3. 0.M.-1.-1 -3 1 0 0 -I .-7 0 -5 0} 

i* * \'\''l''l^- 2 ' 1> " 3 '- 2 - 5 ' °-" 2 - 2 - 2.-2._M,2,2,2,l. 0. 0, 4,-5. 0 -1-2} 

. I L \ ?' « : n - °' °' °- °' °' °' °' °- °- °-- M - °- °- 0. 0, 0. 0, 0. 0. 0. 0. 0}. 

S L »/ ft Si* ^" 5 -" 2 ' °'" 2 - °' 5 '" 3 - °- 1 > 1. 3. 0. 0, 0,2,3. 0.4. 0 , 

S M »/ {ttl'li- 2 '- 4 -" 2 ' 2 ' °-- 3 ' 6 - 4 .-3--M.-3,2,3,3,I. 0. 2,2, 0,1,2}, 

» N ./ / { " n i'l 5 ^'" 2, °'" 3 '- 2 - 2 ' °' °- 4> 6 -2 -M.-2,l. 0,2,1. 0. 2.-4. 0.-2 -1} 

*?. KV£ft^ , KV^w*vs i1, o -- m -- m - m - m . m._m._m._ m . m . m . m >. 

i* n*i >*.'.;•'.'• 5 .- 1 '°'- 2 .0.-l.-3.-2,l,_M, 6. 0,0. 1,0, 0,1,6.0,5.0}. - - _ /■ 

£ » J, S I' «'" 5 ' 2 ' 2 '" 5 '- 1 ' 3 -" 2, °- 1 -- 2 '- 1 - X -- M - °- 4 - 1.-1--1. 0,-2,5. 0.-4 3} 
£ J" 2 ' ^'•i-* 1 - - *-- 3 - 2 -" 2 - °- 3 '- 3 ' °' °'- M . °' »• 6 - 0.-». 0.-2. 2 0.-4 0} 

< 1.-1.-1.0.0,3,2. l._M. 1,1.0,2. 1,0,1,2 0-3 0} 

« n ./ J J' a* °* °'" 3, °' °- °.- M > °'-l.-l. I- 3. 0, 0,5. 0,3. 0} 

m v J, \ « °' °* °' °- °' °' 0l °- °' °' °- °" °.- M > °> °- 0. 0. 0. 0, 0, 0, 0, 0, 0} 

* VI, S °.-2'-2.-2.-2.-l.-l,2. 4. 0,-2, 2, 2,2._M, 1,2,2,1. 0. 0. 4,6. 0,2,2} 

. V ./ ft-ft'l-?- °'- 7M °.-3.-2^.-4..M.-6,5. 2,2,5 0,6 17. 6, 0,-6}'. 

. v J, S ?' °* °* °' °' °- °' °" °- °' °. °- °- °'- M ' o. °. o. o. 0. 0, 0, 0, 0, 0 0} 

« Z V f o t J-; 5 -, 0 '' 1 ' °.- 4 -- 1 --2.-2._M,5,^,-4,3,3, 0,2, 0. 0.10,4}, 

J Z */ {0, 1,5. 2. 3,5, 0, 2,2, 0. 0,-2,-1, l._M. 0, 3. 0. 0. 0, 0,2,6, 0 4} 
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^include <stdio.h> 
^include <ctype.h> 



FIGURE 248B 



^define 


MAXJMP 


16 


^define 


MAXGAP 


24 


^define 


JMPS 


1024 


^define 


MX 


4 


^define 


DMAT 


3 


^define 


DMIS 


0 


#define 


DINSO 


8 


^define 


DINS1 


1 


Idefine 


PINSO 


8 


^define 


PINS1 


4 



struct jmp { 

short 

unsigned short 

}; 

struct diag { 
int 
long 
short 
struct jmp 

}« 



struct path { 
int 



/* max jumps in a diag */ 

/* don't continue to penalize gaps larger than this */ 
/* max jmps in an path */ 

/* save if there's at least MX- 1 bases since last jmp V 

/* value of matching bases */ 

/* penalty for mismatched bases */ 

/* penalty for a gap */ 

/* penalty per base */ 

/* penalty for a gap */ 

/* penalty per residue */ 



n [MAXJMP]; /* size of jmp (neg for dely) */ 
x [MAXJMP]; /* base no. of jmp in seq x */ 
/* limits seq to 2 A 1 6-1 */ 



score; /* score at last jmp */ 

offset; /* offset of prev block */ 

U m P; /* current jmp index */ 

JP; /* list of jmps */ 



short 
int 



S P°> /* number of leading spaces */ 

n[JMPS]; /* size of jmp (gap) */ 

xfJMPS];/* Ioc of jmp (last elem before gap) */ 



char 

char 

char 

char 

int 

int 

int 

int 

int 

int 

int 

int 

int 

long 

struct 

struct 

char 
char 



diag 
path 



*ofiie; 

*namex[2]; 

♦prog; 

*seqx[2]; 

dmax; 

dmaxO; 

dna; 

endgaps; 
gapx, gapy; 
lenO, lenl; 
ngapx t ngapy; 
smax; 
*xbm; 
offset; 
*dx; 
PP[2]; 



/* output file name */ 

/* seq names: getseqsO */ 

/* prog name for err msgs */ 

/* seqs: getseqsO */ 

/* best diag; nw() */ 

/* final diag */ 

/* set if dna: mainO */ 

/* set if penalizing end gaps */ 

/* total gaps in seqs */ 

/* seq lens */ 

/* total size of gaps */ 

/* max score: nw() */ 

/* bitmap for matching */ 

/* current offset in jmp file */ 

/* holds diagonals */ 

/* holds path for seqs */ 



♦callocO, *malloc0. *index0, *strcpy0; 
♦getseqO, *g_callocQ; 
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/* Needleman-Wunsch alignment program 

* usage: progs filel file2 

* where filel and file2 are two dna or two protein sequences. 

* The sequences can be in upper- or lower-case an may contain ambiguity 
Any lines beginning with ';\ ' > • or ' < • are ignored 

* Max file length is 65535 (limited by unsigned short x in the jmp struct) 

A sequence with 1/3 or more of its elements ACGTU is assumed to be DNA 

* Output is in the file "align.out" 

* The program may create a imp file in /tmp to hold info about traceback 
Original version developed under BSD 4.3 on a vax 8650 

*/ 

#include "nw.h" 
include "day.h" 

static _dbvair26] = { 

1.14,2,13,0,0.4,11,0,0,12,0,3,15,0,0,0,5,6,8,8,7,9,0,10,0 

static _pbval[26] = { 

1 , 2 1 (1< < fD'-'A*)) | (1< < CN'-'A')). 4, 8, 16, 32, 64 
128, 256, OxFFFFFFF, 1 < < 10, 1< < 11, 1< < 12, 1< < 13 1<<14 
\t <1 ^ J <<16 ' 1 <<17, 1<<18. 1<<19, 1<<20, 1<<21, 1<<22 
1 < <23, 1 < <24, 1 < <25 |(1 < <CE'--A'))|<1 < <f Q'-'A')) 

main(ac, av) 

int ac; main 
char *avQ; 

prog = av[0]; 
if(ac != 3) { 

rprintf(stderr, "usage: %s filel file2\n\ prog); 

fjprmtf(stderr, "where filel and file2 are two dna or two protein sequencesAn"); 
rpnntf(stderr, "The sequences can be in upper- or lower-case\n")- 
fprintf(stderr,"Any lines beginning with V or 1 < ' are ignored\n")- 
fpnntf(stderr. "Output is in the file \"align.out\"\n")- 
exit(l); 

} 

namexfO] = av[l] ; 

namex[l] = av[2); 

seqx[0] - getseq(namex[0], &lenO); 

seqx[lj = getseq(namex[I], &lenl); 

xbm = (dna)? .dbval : _pbval; 

endgaps - 0; /* 1 to penalize endgaps */ 

ofile = "align.out"; /* output file */ 



nwO; /* fill in the matrix, get the possible jmps */ 

readjmpsO; /* get the actual jmps */ 

P rmt 0; /* print stats, alignment */ 

cleanup(0); /* unlink any tmp files */ 



^/2 70 
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/* do the alignment, return best score: mainO 

* dna: values in Fitch and Smith, PNAS, 80, 1382-1386 1983 

* pro: PAM 250 values 

* When scores are equal, we prefer mismatches to any gap, prefer 

* a new gap to extending an ongoing gap, and prefer a gap in seqx 

* to a gap in seq y. 
*/ 

nw() 
{ 



nw 



char 


*px, *py; 


int 


*ndely, *dely; 


int 


ndelx, delx; 


int 


*tmp; 


int 


mis; 


int 


insO, insl; 


register 


id; 


register 


u; 


register 


*colO, *coll; 


register 


xx, yy; 



/* seqs and ptrs */ 

/* keep track of dely */ 

/♦keep track of delx */ 

/* for swapping rowO, rowl */ 

/* score for each type ♦/ 

/* insertion penalties */ 

/* diagonal index ♦/ 

/* jmp index */ 

/* score for curr, last row ♦/ 

/* index into seqs */ 



dx 



(struct diag *)g_calloc( w to get diags\ lenO + lenl + 1, sizeof(struct diag)); 



ndely = (int *)g_calloc("to get ndely", lenl + 1, sizeof(int)); 
dely = (int *)g_calloc("to get dely", lenl + 1 , sizeof(int)); 
colO = (int *)g_callocCto get co!0\ lenl + 1, sizeof(int)); 
coll = (int *)g_caIlocCto get coll", lenl + 1, sizeof(int))- 
insO = (dna)? DINSO : PINS0; 
insl = (dna)? DINS1 : PINS1; 

smax = -10000; 
if (endgaps) { 

for (co!0[0] - dely[0] - -insO, yy = 1; yy < = lenl; yy+ +) { 
coI0[yy] = delylyy] = coi0[yy-l] - insl; 
ndely(yy] = yy; 

col0[0] = 0; /* Waterman Bull Math Biol 84 */ 



} 

else 



for(yy = l;yy < = l en l;yy++) 
delyfyy] = -insO; 



/* fill in match matrix 



for (px - seqx[0], xx = 1; xx < = lenO; px++, xx++) { 
/* initialize first entry in coi 
*/ 

if (endgaps) { 

if (xx ==1) 

coll[0] = delx = -(insO+insl); 

else 

coll[0] « delx = col0[0] - insl; 
ndelx = xx: 



} 

else { 



coll[0J = 0; 
delx = -insO; 
ndelx = 0; 
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for(py = seqx[l], yy = 1; yy < = lenl; py+ + , yy + +) { 
mis = coI0[yy-l]; 
if(dna) 

mis + = (xbm(*px- , A , ]&xbm(*py- , A , ])7 DMAT : DMIS; 

else 

mis + = _day[*px- t A , ][*py- , A , J; 

/* update penalty for del in x seq; 

* favor new del over ongong del 

* ignore MAXGAP if weighting endgaps 
*/ 

if (endgaps 1 1 ndely[yy] < MAXGAP) { 

if (col0[yy] - ihsO > = dely[yyj) { 

dely[yy] = col0(yy] - (insO + insl); 
ndely[yy] = 1; 

}else{ 

dely[yy] -= insl; 
ndely[yy] + -f ; 

} 

} else { 

if (col0[yyj - (insO + insl) > = dely[yy]) { 
dely[yy] = coI0[yy] - (insO+insl); 
ndely[yy] = 1; 

}else 

ndely[yy] + + ; 



/* update penalty for del in y seq; 

* favor new del over ongong del 
*/ 

if (endgaps 1 1 ndelx < MAXGAP) { 

if (coll[yy-l] - insO > = delx) { 

delx = coll[yy-l] - (insO+insl); 
ndelx = 1; 

} else { 

delx -= insl; 
ndelx + + ; 

} 

}else{ 

if (coll[yy-l] - (insO+insl) > - delx) { 
delx = coll[yy-l] - (insO+insl); 
ndelx = 1; 

}else 

ndelx+ + ; 

} 

/* pick the maximum score; we're favoring 
* mis over any del and delx over dely 
*/ 
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id = xx-yy + lenl-l; mmmW " 
if (mis > = deix && mis > = dely[yy]) 

coIl[yy] = mis; 
else if (delx > = dely[yy]) { 

colljyy] = deix; 

ij «■ dx[id].ijmp; 

if (dx[id].jp.n[0] && (!dna 1 1 (ndelx > - MAXJMP 
&&xx > dxndJ.jp.x[ij] + MX) | | mis > dxfid]. score +DINS0)) { 
dx[id].ijmp++; 
if (+ + jj >= MAXJMP) { 
writejmps(id); 
ij = dxfidj.ijmp = 0; 
dx[id], offset = offset; 

offset += sizeof (struct jmp) + sizeof(offset); 

} 

dx[id].jp.n[ij] = ndelx; 
dx[idj.jp.x[ij] = xx; 
dx[idj. score = deix; 

} 

else { 

coll[yyl = dely[yy]; 
ij = dx[idl.ijmp; 

if (dx[id].jp.n(0] && (!dna 1 1 (ndely[yy] > = MAXJMP 

&&xx > dx[id].jp.x[ij]+MX) 1 1 mis > dx[id]. score +DINS0)) { 
dx[idj.ijmp+ + ; 
if (+ + ij > = MAXJMP) { 
writejmps(id); 
ij = dx[id].ijmp = 0; 
dx[idj. offset = offset; 

offset += sizeof(struct jmp) + sizeof(offset); 

} 

dx(id].jp.n[ij] « -ndely[yy] ; 
dxfid] Jp.xfij] = xx; 
dx[id].score = dely[yy] ; 

if (xx = = len0&&yy < Ienl) { 
/* last col 
*/ 

if (endgaps) 

coll[yy] -= insO+insl*(Ienl-yy); 
if (co!l[yy] > smax) { 

smax = coll[yy]; 

dmax = id; 

} 

} 

} 

if (endgaps && xx < lenO) 

coll[yy-l] -= ms0+insi*(len0-xx); 
if (coll[yy-l) > smax) { 

smax = coll[yy-l]; 

dmax = id; 

} 

^ tmp = coIO; colO - coll; coil = tmp; 

(void) free((char *)ndely); 
(void) free((char *)dely) 
(void) free((char *)col0) 
(void) free((char *)coll) 
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* printO - only routine visible outside this module 

* static: 

* getmatO - trace back best path, count matches: print() 

* pr_aitgnO - print alignment of described in array pQ: print() 

* dumpblockO - dump a block of lines with numbers, stars: pr align() 

* numsO - put out a number line: dumpblockO 

* putlineO -- put out a line (name, [num], seq, [num]): dumpblockO 

* starsO - -put a line of stars: dumpblockO 

* stripnameO - strip any path and prefix from a seqname 



^include "nw.h" 

^define SPC 3 

^define P LINE 256 

^define P SPC 3 



extern 

int 

FILE 

printO 
{ 



_day(26][26]; 

oien; 

*fx; 



/* maximum output line */ 

/* space between name or num and seq */ 



/* set output line length */ 
I* output file */ 



lx, ly, firstgap, lastgap; /* overlap */ 

fopen(ofiIe, "w")) == 0) { 

rprintf(stderr,"%s: can't write %sVn*\ prog, ofile); 

cleanup(l); 



print 



int 

if ((fx 
} 

fpnntf(rx, -<first sequence: %s (length - %d)\n\ namex[0], lenO)- 
o^en nt =^ " <SeC ° ndSeqUenCe: %S(length = %d ^ n "> nunex[l], lenl); 
lx = lenO; 
ly = lenl; 

firstgap = lastgap = 0; 

if (dmax < lenl - 1) { /* leading gap in x */ 
pptOj.spc = firstgap = lenl - dmax - 1; 
ly PPfOJ.spc; 

> 

else if (dmax > lenl - 1) { /* leading gap in y */ 
PP[l].spc = firstgap = dmax - (lenl - 1); 
lx -= pp[l].spc; 

if (dmaxO < lenO - 1) { /* trailing gap in x */ 
lastgap - Ien0-dmax0-l; 
lx -= lastgap; 

else if (dmaxO > lenO - 1) { /* trailing gap in y */ 

lastgap = dmaxO - (lenO - 1); 
^ ly -= lastgap; 

getmat(lx, ly, firstgap, lastgap); 
pralignQ: 
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{ 



Mrace back the best path, count matches 
static 

getmatflx, ly, firstgap, lastgap) 

int lx, Jy; /* - CQre « (minus end } mf 

mt firstgap, lastgap; /* fading trailing overlap */ 

mt nm f iO, il. sizO, sizl; 

outxf32] ; 
double pet; 
register nO.nl; 
register char *p0, *pl; 

/* get total matches, score 
*/ 

iO = il _ sizO = sizl = 0; 
pO = seqx[0J + pp[l].spc; 
pi = seqx[i] + pp[0).spc; 
nO = ppflj.spc + I; 
nl = pp[0].spc + 1; 

nm = 0; 

while ( *p0 && *pl ) { 
if (sizO) { 

pi + + ; 
nl + + ; 
sizO--; 

} 

else if (sizl) { 

p0+ + ; 



} 

else { 



nO++; 
Sizl—; 



if (xbmf^pO-'A'J&xbm^pl-'A']) 

nm+ + ; 
if (nO++ pp[0].xfiO]) 

sizO = pp[0].n[iO++l; 
if(nl + + =-pp[lj.x[il]) 

sizl - pp[lJ.n[il + +J; 

p0+ + ; 
pl + + ; 



/* pet homology: 

* if penalizing endgaps, base is the shoner seq 
^else, knock off overhangs and take shorter core 

if (endgaps) 

Ix«(len0 < lenl)? lenO : lenl; 

else 

lx = (lx < ly)? lx : ly; 
pet = 100.*(doubJe)nm/(doubie)lx; 
fjprintf(fx, -\n-); 

<printf(fx "< %d matches in an overlap of %d: %.2f percent similarityVn', 
nm. (nm 1)? - : -esMx.pct); 
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rprintf(fx, " <gaps in first sequence: %d\ eapxV . . 

if(ga P x){ 8P; —getmat 

(void) sprintf(outx, ■ (%d %s%s)\ 

ngapx, (dna)? "base": "residue", (ngapx mm iy> *-.« s »y 
fprintf(fx."%s",outx); 

fprintf(fx, gaps in second sequence: %d", gapy)* 
tf(gapy){ 

(void) sprintf(outx, • (%d %s%s)\ 

ngapy, (dna)? "base": "residue", (ngapy == m •"•- S "V 

rprintf(fx,"%s",outx) ; * Py >' ' &h 

} 

if (dna) 

fprintf(fx, 

"it sss? = %d - gap - %d + %d ^ b * 

else 

fprinrf(fx, 

«NS0 % pSr fPAM ^ ^ ^ ^ = %d + %d reSid ^""- ' 
if (endgaps) 

fprintf(fx, 

' <endgaps penalized, left endgap: %d %s%s, right endgap: %d %s%s\n" 
firstgap, (dna)? "base" : "residue", (firstgap = = 1)? - ■ V 
lastgap, (dna)? "base" : "residue", (lastgap = = 1)? — : - s -) ; * 

fprimf(fx, " < endgaps not penalized\n"); 



} 



!***! c " m: >* matches in core - for checking */ 

™ :! C ,max; '* ^ngths of stripped file names */ 

static U[2]; /* jmp index for a path */ 

S tar n ^ 21 ' '* number al start of current line */ 

S £ c mt2 ^ /* current elem number - for gapping */ 

static siz{2j| ; 

static char * ps [2]; /* ptr to current element */ 

static char *po[ 2 ]; /* ptr to next output char slot •/ 

staUc char out[2][P LINE]; /* output line */ 

static char star[PLINE]; /* se t by starsO */ 

/* 

*^print alignment of described in struct path ppQ 

static 
pr.alignO 

{ pralign 

™ l nn; /* char count */ 

mt more; 
register i ; 

for (i = 0, lmax = 0; i < 2; i + +) { 
nn = stripname(namex{il); 
if (nn > lmax) 

lmax = nn; 

nc[i) = i; 
ni[i] = 1; 
siz[i] ij[i] » 0; 
ps[i] = seqxfi]; 
po[iJ « out[i]; 
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for (nn = nm = 0, more = 1; more; ) { 

for (i = more = 0; i < 2; i++) { ...prjlllgn 
/* 

*^do we have more of this sequence? 

if(!*ps[i]) 

continue; 

more+ + ; 

if (PPW-spc) { /* leading space */ 
*po[i] + + = ' 
pp[i].spc~; 

} 

else if (sizM) { /* in a gap */ 
*po[i] + + = 
sizfi]-; 

} 

eke t >* we're puning a seq element 

*/ 

*po[iJ = *ps[i] ; 
if (islower(*ps[il)) 

*ps[i] = toupper(*ps[i]); 

po[i] + +; 
ps[i] + + ; 

/* 

* are we at next gap for this seq? 
*/ ~m 

if(ni[i] " pp(i].x[ij[i]]) { 
/* 

* we need to merge ail gaps 

* at this location 
*/ 

siz(i) = pp[i].n[ij[i]++]; 
while (ni[i] ==ppU].x[ij[i]J) 

sizfij +=pp[i].n(!j[i]++]; 

ni[i] + + ; 

, 1 

if (++nn == olen 1 1 !more&& nn) { 
dumpblockO; 
for(i = 0; i < 2; i + +) 
pofij = out{i]; 

nn = 0; 

} 



} 



/* 

* Arnip a block of lines, including numbers, stars: pr_aiign() 
static 

dumpblockO 

{ dumpblock 

register i; 

for(i = 0; i < 2; i + +) 
•po[i]~ = 'UK; 
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(void) pmcOn\ fx); ...dumpblock 

for (i = 0; i < 2; i + +) { 

if (*out[iJ && (*out[i] ! = ' • 1 1 *(po[ij) !=")){ 
if(i==0) 

nums(i); 
if (i ===== o && *out[lJ) 
siarsO; 

putline(i); 

if (i 0&& *out{lJ) 

rprintf(fx, star); 
if(i == 1) 
^ nums(i); 



/* 



* put out a number line: dumpblockO 

static 
nums(ix) 

int i x: /*inrfPvin«n f nK^^,:_.. ( niimS 



^ mt /* index in outQ holding seq line */ 



char 
register 



nJine[P LINE]; 
«. j; 



register char * pn , *px, *py; 

for(pn = niine, i = 0; i < imax+P SPC; i+ + , pn++) 
*pn = ' 

for (i - nc(ix], py = outfix]; * py; py+ +, pn + +) { 

if (*py == ' ' || *py *_•) 

*pn = * 



else { 



} 

else 

>+ + ; 
} 

} 

*pn = '\0' ; 
nc[ix] = i; 

for (pn = niine; *pn; pn+ +) 
(void) putc(*pn, fx); 
(void) putc('\n\ fx); 



if (i%!0 «- 0 1 1 (i — i && nc[ix] != 1)) { 
j - (i < 0)?-i:i; 
for(px =pn;j;j/= 10, px~) 
*px = j%10 + '0'; 

if a < o) 



*pn - ' *; 



/* 

*^put out a line (name, [num], seq, [num]): dumpblockO 

static 
putline(ix) 

int ix; putline 

{ 
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..•outline 

int i; r 

register char *px; 

for(px - namexfix], i = 0; *px &Sc *px != V; px+ + . i++) 

(void) putc(*px, fx); 
for(; i < lmax+P_SPC; i++) 

(void) putcC \ fx); 

/* these count from 1 : 

* ni[] is current element (from 1) 

* ncQ is number at start of current line 
*/ 

for (px - out[ix]; *px; px++) 

(void) putc(*px&0x7F, fx); 
(void) putc('\n\ fx); 

} 



*^put a line of stars (seqs always in out[0] f out[l]): dumpblock() 

static 
starsO 
{ 

int i; 

register char *pO, *pl, ex. *px; 

if (!*out[0J 1 1 (*out[0] *(po[0]) == • ') 1 1 

!*out[l] 1 1 (*out[l] = ='•&& *(po[lJ) = =•')) 
return; 
px = star; 

for (i = lmax+P_SPC; i; i-) 
*px+ + = ' 

for (pO = out[0J, pi = out[lJ; *p0 && *pl; p0++, pl + +) { 
if (isalpha(*pO) && isalpba(*pl)) { 

if (xbm^pO-'AI&xbmrpl-'A']) { 
cx = 
nm+ + ; 

} 

elseif (!dna&& day^pO-'A'J^pl-'A'I > 0) 

cx = V; 

else 

cx = ' '; 

} 

else 

cx = • '; 
*px++ = cx; 

} 

*px+ + = '\n'; 
♦px = *\0'; 



stars 



# 
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/* 

*^strip path or prefix from pn, return len: pr_align() 



static 

stripname(pn) . 

^ char *pn; /• file name (may be path) */ StTipname 

register char *px, *py; 

py = 0; 

for (px = pn; *px; px+ +) 
if (*p X = = V) 

py - px + 1; 

if(py) 

(void) strcpy(pn, py); 
return(strien(pn)); 
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/* 

* cleanupO ~ cleanup any tmp file 

* getseqO - read in seq. set dna, len, maxien 

* g_callocO - callocO with error checkin 

* readjmpsO - get the good jmps, from tmp file if necessary 
^writejmpsO - write a filled array of jmps to a tmp file: nwO 

^include "nw.IT 
^include <sys/file.h> 



char *jname = Vtmp/homgXXXXXX": 
FILE *rj; 



int 



cleanupO; 
lseekO; 



/* tmp file for jmps */ 
/* cleanup tmp file */ 



* remove any tmp file if we blow 
*/ 

cleanup(i) 
{ 



int 

if(fj) 



exit(i); 



(void) uniink(jname); 



cleanup 



* read, return ptr to seq, set dna. len, maxien 

* skip lines storting with Y. ' < \ or ' > * 

* seq in upper or lower case 
*/ 

char * 

getseq(file, len) 

char *fi!e; 
int *len; 



{ 



/* file name */ 
/* seq len */ 



char 

register char 

int 

FILE 



line[1024], *pseq; 

*px, *py; 

natgc, tlen; 



if «fp = fopen(file,"r-)) == 0) { 

fprintf(stderr,"%s: can't read %s\n\ prog, file)- 
exit(l); F 

} 

tien = natgc = 0; 

while (fgets(line, 1024, fp)) { 

if(*line == •;• || *line== •<• || *Hne == •>•) 

continue; 
for (px - line; *px ! = '\n'; px+ +) 

if (isupper(*px) 1 1 islower(*px)) 
tlen+ + ; 

} 

if ((pseq = malloc((unsigned)(tlen+6))) « 0) { 

rprintf(stderr,"%s: mailocO failed to get %d bytes for %s\n\ prog, tlen+6, file); 
exit(I); 

> 

pseq[0] = pseq[l] = pseqPJ = pseq^J = -\0'; 
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} 



py = pseq + 4; ...getseq 

*len - tlen; 

rewind(fp); 

while (fgets(line, 1024, fp)) { 

if dine" || *line== •<■ || *Hne= = •>•) 
continue; 

for (px = line; *px != 'm'; px + +) { 
if (isupper(*px)) 

*Py + + = *px; 
else if (is!ower(*px)) 

*py++ = toupper(*px); 
if (index("ATGCU\*(py-l))) 

natgc+ + ; 

} 

*Py+ + = *\0'; 

*py - '\0'; 

(void) fclose(fp); 

dna = natgc > (tlen/3); 

return(pseq+4); 



char * 

g_calIoc(msg, nx t sz) 

char *msg; /* program, calling routine */ g-Calloc 

mt nx, sz; /* number and size of elements */ 

char *px, *callocO; 

if ((px = calIoc((unsigned)nx, (unsigned)sz)) = = 0W 
if(*msg){ M 

Jri«f(stderr, •%„ g ^alloc 0 failed ft, (n=%d, sz=%d)Vn", prog, msg, nx, sz); 

, 1 

^ return(px); 
/* 

*&* final jmps from dxQ or tmp file, set ppO, reset dmax: mainO 
readjmpsO 

{ readjmps 

int fd - -1; 

mt siz, K),il; 

i. j, xx; 



if(Q){ 

(void) fclose(rj); 

if ((fd - open(jname, ORDONLY, 0)) < 0) { 

rprintf(stderr, "%s: cantopenO %s\n\ prog, jnameV 
^ cleanup(l); 

for (i = i0 a il m 0, dmaxO - dmax, xx = lenO; ; i+ +) f 
while (i) { ' 1 

for(j m dx[dmax].ijmp; j > = 0 && dxfdmax].jp.x(j) >= xx; j~) 
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if (j < o && dx[dmax].offset && fj) { ...readjmps 

(void) lseek(fd, dx(dmax]. offset, 0); 

(void) read(fd, (char *)&dx[dmax] jp, sizeof(struct jmp)); 

(void) read(fd, (char *)&dx[dmax] .offset, sizeof(dx[dmax] . offset))- 

dx[dmax].ijmp = MAXJMP-1; 

else 

break; 

} 

if (i > = JMPS) { 

rprintf(stderr, "%s: too many gaps in aiignment\n\ prog)- 
cleanup(l); F 

} 

if (I > = 0){ 

siz « dxfdmax] jp.n[j]; 
xx = dxfdmax] jp.x(j]; 
dmax + = siz; 

if (siz < 0) { /* g a p in second ^ v 

PP[l].n[il] = -siz; 
xx += siz; 

/* id - xx - yy + lenl - 1 
*/ 

PPU] x[il] = xx - dmax + lenl - 1; 
gapy+ + ; 

. ngapy -= siz; 

r ignore MAXGAP when doing endgaps »/ 

siz = (-siz < MAXGAP 1 1 endgaps)? -siz ; MAXGAP; 
il + + ; 

} 

else if (siz > 0) { /* gap in first seq V 
PP[O].n[i0] = siz; 
PPf0].x[i0] - xx; 
gapx++; 
ngapx + = siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (siz < MAXGAP 1 1 endgaps)? siz : MAXGAP; 
i0+ + ; 

> 

} 

else 

break; 

/* reverse the order of jmps 

for (j = 0, i0~; j < i0; j+ + , i0-) { 

I = PP[0].n[j]; pp[0].nfj] m pp[0J.n[i0]; pp[0].n[i0] = i; 
I = PPl0].xD]; ppI0].x(j] - pp[0].x[i0J; pp[0j.x[i0] = i; 

for(j m 0, il~; j < il; j + + , il-) { 

j = PP[«.n[j]; PP[l].n[j] « pp[l].n[il] ; pp[l].n[il] = i; 

■ - PPUUUk PPtU.xOl = P p[l].x[ilJ; PP [lJ.x[il] = i; 

if(fd > = 0) 

(void) close(fd); 

if(®{ 

(void) unhnk(jname); 
fj =0; 
offset = 0; 

} 



* 
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/* 

*^write a filled jmp struct offset of the prev one (if any): nw() 

writejmp S (ix) writejmps 
int ix; 

{ 

char *mktempO; 
if(!fj){ 

if (mktemp(jname) < 0) { 

rprintf(stderr, "%s: can't mktempO %s\n\ prog, jname); 
cleanup(l); 

} 

if ((fj - fopen(jname, "w")) 0) { 

fprintf(stderr t "%s: can't write %s\n", prog, jname); 
exit(l); 

} 

} 

(void) rwrite((char *)&dx[ix] jp, sizeof(struct jmp). 1, fj); 
(void) rwrite((char *)&dx[ixj. offset, sizeof(dx[ixJ. offset), 1, fj); 



